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NOGO RECEPTOR HOMOLOGS 



FIELD OF THE INVENTION 

5 The invention relates to neurology and molecular biology. More particularly, 

the invention relates to CNS neurons and axonal growth 

BACKGROUND 

Among the mechanisms through which the cells of an organism communicate 

10 with each other and obtain information and stimuli from their environment is through 
cell membrane receptor molecules expressed on the cell surface. Many such receptors 
have been identified, characterized, and sometimes classified into major receptor 
superfamilies based on structural motifs and signal transduction features. The receptors 
are a first essential link for translating an extracellular signal into a cellular 

15 physiological response. 

Receptors on neurons are particularly important in the development of the 
nervous system during embryogenesis. The neurons form connections with target cells 
during development through axonal extension of the neurons toward the target cells in 
a receptor-mediated process. Axons and dendrites have a specialized region of their 

20 distal tips known as the growth cone. Growth cones enable the neuron to sense the 

local environment through a receptor-mediated process and direct the movement of the 
axon or dendrite of the neuron toward the neuron's target cell. This process is known 
as elongation. Growth cones can be sensitive to several guidance cues, for example, 
surface adhesiveness, growth factors, neurotransmitters and electric fields. The 

25 guidance of growth at the cone depends on various classes of adhesion molecules, 
intercellular signals, as well as factors that stimulate and inhibit growth cones. 

Interestingly, damaged neurons do not elongate in the central nervous system 
(CNS) following injury due to trauma or disease, whereas axons in the peripheral 
nervous system (PNS) regenerate readily. The fact that damaged CNS neurons fail to 

30 elongate is not due to an intrinsic property of CNS axons, but rather due to the CNS 
environment that is not permissive for axonal elongation. Classical grafting 
experiments by Aguayo and colleagues {e.g., Richardson et al, (1980) Nature 284, 
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264-265) demonstrated that CNS axons can in fact elongate over substantial distances 
within peripheral nerve grafts, and that CNS myelin inhibits CNS axon elongation. 
Therefore, given the appropriate environment, CNS axons can regenerate, implying 
that CNS axonal injury can potentially be addressed by appropriate manipulation of the 
5 CNS environment. 

The absence of axon regeneration following injury can be attributed to the 
presence of axon growth inhibitors. These inhibitors are predominantly associated 
with myelin and constitute an important barrier to regeneration. Axon growth 
inhibitors are present in CNS-derived myelin and the plasma membrane of 

10 oligodendrocytes that synthesize myelin in the CNS (Schwab et al , (1993) Annu. Rev. 
Neurosci. 16, 565-595).* Myelin-associated inhibitors appear to be a primary 
contributor to the failure of CNS axon regeneration in vivo after an interruption of 
axonal continuity, whereas other non-myelin associated axon growth inhibitors in the 
CNS may play a lesser role. These inhibitors block axonal regeneration following 

15 neuronal injury due to trauma, stroke or viral infection. 

Numerous myelin-derived axon growth inhibitors have been characterized (see, 
for review, David et al, (1999) W0995394547; Bandman et al, (1999) U.S. Patent 
No. 5,858,708; Schwab, (1996) Neurochem. Res. 21, 755-761). Several components 
of CNS white matter, NI35, NI250 (Nogo) and Myelin-associated glycoprotein 

20 (MAG), which have inhibitory activity for axonal extension, have been described as 
well (Schwab et al., (1990) WO9005191; Schwab etal 9 (1997) U.S. Patent No. 
5,684,133). In particular, Nogo is a 250 kDa myelin-associated axon growth inhibitor 
that was originally characterized based on the eflfects of the purified protein in vitro 
and monoclonal antibodies that neutralize the protein's activity (Schwab (1990) Exp. 

25 Neurol 109, 2-5). The Nogo cDNA was first identified through random analysis of 
brain cDNA and had no suggested function (Nagase et al, (1998) DNA Res. 5, 
355-364). The identification of this Nogo cDNA as the cDNA encoding the 250 kDa 
myelin-associated axon growth inhibitor was discovered only recently (GrandPre et al, 
(2000) Nature 403, 439-444; Chen et al, (2000) Nature 403, 434-439; Prinjha at al, 

30 (2000) Nature 403, 383-384). 

Importantly, Nogo has been shown to be the primary component of CNS 
myelin responsible for inhibiting axonal elongation and regeneration. Nogo's selective 
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expression by oligodendrocytes and not by Schwann cells (the cells that myelinate P.S. 
axons) is consistent with the inhibitory effects of CNS myelin, in contrast to P.S. 
myelin (GrandPre et aL, (2000) Nature 403, 434-439). In culture, Nogo inhibits 
axonal elongation and causes growth cone collapse (Spillmann et aL, (1998) J. Biol. 
5 Chem. 272, 19283-19293). Antibodies (e.g., IN-1) against Nogo have been shown to 
block most of the inhibitory action of CNS myelin on neurite growth in vitro 
(Spillmann et aL, (\99$)J.BioL Chem. 272:19283-19293). These experiments 
indicate that Nogo is the main component of CNS myelin responsible for inhibition of 
axonal elongation in culture. Furthermore, in vivo, the IN-1 antibody has been shown 

10 to enhance axonal regeneration after spinal cord injury, resulting in recovery of 

behaviors such as contact placing and stride length (Schnell and Schwab (1990) Nature 
343, 269-272; Bregman et aL, (1995) Nature 378, 498-501). Thus, there is 
substantial evidence that Nogo is a disease-relevant molecular target. Agents that 
interfere with the binding of Nogo to its receptor would be expected to improve axonal 

15 regeneration in clinical states in which axons have been damaged, and improve patient 
outcome. 

Modulation of Nogo has been described as a means for treatment of 
regeneration for neurons damaged by trauma, infarction and degenerative disorders of 
the CNS (Schwab et aL, (1994) W09417831; Tatagiba et aL, (1997) Neurosurgery 
20 40, 541-546) as well as malignant tumors in the CNS such as glioblastoma (Schwab et 
aL, (1993) U.S. Patent No. 5,250,414); Schwab et aL, (2000) U.S. Patent No. 
6,025,333). 

Antibodies which recognize Nogo have been suggested to be useful in the 
diagnosis and treatment of nerve damage resulting from trauma, infarction and 

25 degenerative disorders of the CNS (Schnell & Schwab, (1990) Nature 343, 269-272; 
Schwab et aL, (1997) U.S. Patent No. 5,684,133). For CNS axons, there is a 
correlation between the presence of myelin and the inhibition of axon regeneration 
over long distances (Savio and Schwab (1990) Proc. Natl. Acad Set 87, 4130-4133; 
Keirstead etaL, (1992) Proc. Natl. Acad Sci. 89, 11664-11668). After Nogo is 

30 blocked by antibodies, neurons can again extend across lesions caused by nerve 
damage (Schnell and Schwab (1990) Nature 343, 269-272). 
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SUMMARY OF THE INVENTION 

Genes encoding homologs (NgR2 and NgR3) of a Nogo receptor (NgRl) in 
mice and humans have been discovered. Various domains in the polypeptides encoded 
by the NgR2 and NgR3 genes have been identified and compared to domains in mouse 
5 and human NgRl polypeptides. This comparison has led to identification of a 

consensus sequence (NgR consensus sequence) that characterizes a family of proteins 
(NgR family). Based on these and other discoveries, the invention features molecules 
and methods for modulating axonal growth in CNS neurons. 

The invention provides a polypeptide that contains a polypeptide containing a 
10 tryptophan rich LRRCT domain consisting of the amino acid sequence: 

NX 1 WX 2 CX 3 CRARX 4 LWX 5 WX 6 X 7 X 8 X 9 RX 10 SSSX 11 V 
X 12 C X 13 X 14 P X 15 X 16 X n X 18 X 19 X 20 D L X 21 X^ L X^ X 24 X 25 D 

15 

X^X^X.gCtSEQIDNO.'lP] 



wherein X is any protein amino acid or a gap, and the polypeptide does not 
include amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 
20 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). 



Preferably, XI 7 and X23 are (independently) arginine or lysine. In some 
embodiments, the amino acid sequence of the LRRCT domain is residues 261-3 10 of 
SEQ ID NO:2, or residues 261-3 10 of SEQ ID NO: 2 with up to 10 conservative 
25 amino acid substitutions. In some embodiments, the polypeptide contains the 
following NTLRRCT amino acid sequence: 

C P X x X 2 C X 3 C Y X 4 X 5 P Xg X 7 T X 8 S C X^ X 10 X u X 12 X 13 X 14 X 15 P 
X 17 X 18 X 19 P X 20 X 21 X 22 X^ R X 24 F L X 25 X 26 N X 27 1 X 28 X 29 X 30 X 31 X 32 X 33 
X 34 F X 35 X 36 X 37 X 38 X 39 X 40 X 41 X 42 L W X 43 X^ S N X 45 X^ X4 7 X 48 1 X 49 
30 X 50 X 51 X 52 FX 53 X 54 X 55 X 56 X 57 LEX 58 LDLX 59 DNX 60 X 61 LX 62 X 63 X 64 

Xgs P Xgg T F X 67 G L X 68 X 69 L X 70 X 71 L X 72 L X 73 X 74 C X 75 L X 76 X 77 L X 78 
X 7J X l0 X >1 FX E GLX„X M LQYLYLQX ls NX 8{ X n X M X B9 LX 91) D 
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X91 X92 F X93 D L X94 N L X95 HLFLHGN X95 X97 X^ X99 Xjq 0 X 10 i X 102 
X103 -^104 F R G L X 105 X 106 LDRLLLH X 107 N X 108 X 109 X uo X 1U V H X 112 
X U3 A F X 114 X U5 L X n6 R L X n7 X U8 L X U9 L F X 12 o N X 12 i L X 122 X 12 3 L 
-^124 X 125 X 126 X 127 L X 128 X 129 L X 130 X 131 L X 132 X 133 L R L N X 134 N X 135 W 
X 136 C X 137 C R X 138 R X 139 L W X 140 W X 141 X 142 X M3 X 144 R X 145 S S S X 146 
V X 147 C X 148 X 149 P X 150 X 1S1 X 152 X 1S3 X 154 X 155 D L X 156 X 157 L X 158 X 159 X 160 
D X 161 X 162 X 163 C [SEQ ID NO:18] 



wherein X is any amino acid residue or a gap and wherein the polypeptide is not the 

10 polypeptide of SEQ ID NO: 5 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). For 
example, X^ X 37 and X 3g may represent a gap. Specific examples of polypeptides of 
the invention are SEQ ID NO: 2 (human NgR2), SEQ ID NO: 4 (mouse NgR3), and 
SEQ ID NO: 14 (human NgR3). In some embodiments, the polypeptide contains: (a) 
a NTLRRCT domain, and (b) less than a complete CTS domain, provided that a partial 

15 CTS domain, if present, consists of no more than the first 39 amino acids of the CTS 
domain. While the polypeptide may contain a functional GPI domain, a functional GPI 
domain may be absent, e.g., when a soluble polypeptide is desired. A polypeptide of 
the invention optionally includes an amino acid sequence of a heterologous 
polypeptide, e.g., an Fc portion of an antibody. 

20 The invention also provides a nucleic acid encoding an above-described 

polypeptide; a vector containing the nucleic acid, which nucleic acid may be operably 
linked to an expression control sequence; and a transformed host cell containing the 
vector. A method of producing a polypeptide of the invention is also provided. The 
method includes introducing a nucleic acid encoding the above-described polypeptide 

25 into a host cell, culturing the cell under conditions suitable for expression of the 
polypeptide, and recovering the polypeptide. 

The invention also provides an antisense molecule whose nucleotide sequence 
is complementary to a nucleotide sequence encoding a polypeptide selected from the 
group consisting of: a polypeptide consisting of residues 3 1 1-395 of SEQ ID NO: 2, a 

30 polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a polypeptide 

consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid is from 8 to 
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100 nucleotides in length, e.g., about 20, 30, 40, 50, 60, 70, 80 or 90 nucleotides. The 
invention also provides a nucleic acid encoding such an antisense molecule. 

The invention also provides an antibody that binds to an above-described 
polypeptide. Polypeptides or antibodies of the invention can be formulated into 
5 pharmaceutical compositions containing the polypeptide or antibody and a 
pharmaceutical^ acceptable carrier. 

The invention also provides a method for decreasing inhibition of axonal 
growth of a CNS neuron. The method includes the step of contacting the neuron with 
an effective amount of a polypeptide or antibody of the invention. 

10 The invention also provides a method for treating a central nervous system disease, 
disorder or injury. The method includes administering to a mammal, e.g., a human, an 
effective amount of a polypeptide or antibody of the invention. Exemplary diseases, 
disorders and injuries that may be treated using molecules and methods of the 
invention include, but are not limited to, cerebral injury, spinal cord injury, stroke, 

15 demyelinating diseases, e.g., multiple sclerosis, monophasic demyelination, 
encephalomyelitis, multifocal leukoencephalopathy, panencephalitis, 
Marchiafava-Bignami disease, Spongy degeneration, Alexander's disease, Canavan's 
disease, metachromatic leukodystrophy and Krabbe's disease. 

The invention also provides a method for identifying a molecule that binds a 

20 * polypeptide of the invention. The method includes the steps of: (a) providing a 
polypeptide of the invention; (b) contacting the polypeptide with the candidate 
molecule; and (c) detecting binding of the candidate molecule to the polypeptide. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which the 

25 invention belongs. In case of conflict, the present application, including definitions, 
will control. All publications, patent and other references mentioned herein are 
incorporated by reference. 

The materials, methods and examples presented below are illustrative only, and 
not intended to be limiting. Other features and advantages of the invention will be 

30 apparent from the detail description and from the claims. 



BRIEF DESCRIPTION OF THE FIGURES 
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Fig. 1 A-1B shows an alignment of NgR2 (SEQ ID NO:2) and NgR3 (SEQ ID 
NO:4) with the known NgR, NgRl (SEQ DD NO: 5) and the Consensus Sequence 
(SEQIDNO:6). 

Fig. 2. niNgR3 does not bind hNogoA(1055-l 120). COS-7 cells were 
transfected with vectors encoding myc-NgRl or myc-NgR3, fixed, and stained with 
anti-myc antibodies or AP-hNogoA(1055-l 120). 

Fig.3. An alignment of the amino acid sequences of human NgRl, murine 
NgRl, murine NgR3, human NgR3 and human NgR2. Numbering begins with amino 
acid #1 of murine NgR3. The consensus sequence is listed below. The LRR NT 
domain is indicated by a shaded box; domains LLR 1, LLR 3, LLR 5, and LLR 7 are 
indicated by open boxes; LLR 2, LLR 4, LLR 6 and LLR 8 are indicated by shaded 
boxes; and the LLR CT domain is indicated by a shaded box. Amino acids in bold in 
LLR 8 indicate a conserved glycosylation sites. A dot indicates conserved cystine 
residue in LRR4. Box at C terminus indicates putative GPI signals. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides purified and isolated polynucleotides (e.g., 
DNA sequences and RNA transcripts, both sense and complementary antisense 
strands, both single- and double-stranded, including splice variants thereof) encoding 
NgR homologs, referred to herein as NgR Unless indicated otherwise, as used herein, 
the abbreviation in lower case (NgR) refers to a gene, cDNA, RNA or nucleic acid 
sequence, whereas the upper case version (NgR) refers to a protein, polypeptide, 
peptide, oligopeptide, or amino acid sequence. Specific proteins are designated by 
number, e.g., "NgR2" is a human NgR homolog, "NgR3" is a murine-derived NgR 
homolog, and "NgRl" is the known NgR identified by Dr. Stephen Strittmatter. 
Known NgRs are herein referred to as "NgRs." DNA polynucleotides of the invention 
include genomic DNA, cDNA and DNA that has been chemically synthesized in whole 
or in part. 
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Standard reference works setting forth the general principles of recombinant 
DNA technology known to those of skill in the art include Ausubel et aL, CURRENT 
Protocols in Molecular Biology, John Wiley & Sons, New York (1998); 
Sambrook et al, MOLECULAR CLONING: A LABORATORY MANUAL, 2d Ed., Cold 
5 Spring Harbor Laboratory Press, Plainview, New York (1989); Kaufinan et al 3 Eds., 
Handbook of Molecular and Cellular Methods in Biology and Medicine, 
CRC Press, Boca Raton (1995); McPherson, Ed., DIRECTED MUTAGENESIS: A 
Practical Approach,. IRL Press, Oxford (1991). 

As used herein, the term "axon" refers to a long cellular protrusion from a 
10 neuron, whereby action potentials are conducted, either to or from the cell body. 

As used herein, the term "axonal growth" refers to an extension of the long 
process or axon, originating at the cell body and proceeded by the growth cone. 

As used herein, the term "central nervous system disorder" refers to any 
pathological state associated with abnormal function of the central nervous system 
15 (CNS). The term includes, but is not limited to, altered CNS function resulting from 
physical trauma to cerebral tissue, viral infection, autoimmune machanisms and genetic 
mutation. 

As used herein, the term "demyelinating disease" refers to a pathological 
disorder characterized by the degradation of the myelin sheath of the oligodendrocyte 
20 cell membrane. 

As used herein, the term "growth cone" refers to a specialized region at the tip 
of a growing neurite that is responsible for sensing the local environment and moving 
the axon toward its appropriate synaptic target cell. 

As used herein, the term "growth cone movement" refers to the extension or 
25 collapse of the growth cone toward a neuron's target cell. 

As used herein, the term "neurite" refers to a process growing out of a neuron. 
As it is sometimes difficult to distinguish a dendrite from in axon in culture, the term 
"neurite" is used for both. 

As used herein, the term "oligodendrocyte" refers to a neuroglial cell of the 
30 CNS whose function is to myelinate CNS axons. 

"Synthesized" as used herein and understood in the art, refers to 
polynucleotides produced by purely chemical, as opposed to enzymatic, methods. 
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"Wholly" synthesized DNA sequences are therefore produced entirely by chemical 
means, and "partially" synthesized DNAs embrace those wherein only portions of the 
resulting DNA were produced by chemical means. By the term "region" is meant a 
physically contiguous portion of the primary structure of a biomolecule. In the case of 
5 proteins, a region is defined by a contiguous portion of the amino acid sequence of that 
protein. The term "domain" is herein defined as referring to a structural part of a 
biomolecule that contributes to a known or suspected function of the biomolecule. 
Domains may be co-extensive with regions or portions thereof; domains may also 
incorporate a portion of a biomolecule that is distinct from a particular region, in 

10 addition to all or part of that region. Examples of NgR protein domains include, but 
are not limited to, the signal peptide, extracellular (z'.e., N-terminal) domain, and 
leucine-rich repeat domains. 

As used herein, the term "activity" refers to a variety of measurable indicia 
suggesting or revealing binding, either direct or indirect; affecting a response, i.e., 

15 having a measurable affect in response to some exposure or stimulus, including, for 
example, the affinity of a compound for directly binding a polypeptide or 
polynucleotide of the invention, or, for example, measurement of amounts of upstream 
or downstream proteins or other similar functions after some stimulus or event. Such 
activities may be measured by assays such as competitive inhibition of NgRl binding to 

20 Nogo assays wherein, for example, unlabeled, soluble Ngjl2 is added to an assay 

system in increasing concentrations to inhibit the binding of Nogo to NgRl expressed 
on the surface of CHO cells. As another example, one may assess the ability of 
neurons to extend across lesions caused by nerve damage (as in Schnell and Schwab 
(1990) Nature 343, 269-272) following inhibition of Nogo by various forms of NgR2 

25 and/or NgR3 as a biological indicator of NgR function. 

As used herein, the term "antibody" is meant to refer to complete, intact 
antibodies, and Fab, Fab 1 , F(ab)2, and other fragments thereof. Complete, intact 
antibodies include monoclonal antibodies such as murine monoclonal antibodies, 
chimeric antibodies, anti-idiotypic antibodies, anti-anti-idiotypic antibodies, and 

30 humanized antibodies. 

As used herein, the term "binding" means the physical or chemical interaction 
between two proteins or compounds or associated proteins or compounds or 
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combinations thereof. Binding includes ionic, non-ionic, hydrogen bonds, Van der 
Waals, hydrophobic interactions, etc. The physical interaction, the binding, can be 
either direct or indirect, indirect being through or due to the effects of another protein 
or compound. Direct binding refers to interactions that do not take place through or 
5 due to the effect of another protein or compound but instead are without other 
substantial chemical intermediates. 

As used herein, the term "compound" means any identifiable chemical or 
molecule, including, but not limited to, small molecules, peptides, proteins, sugars, 
nucleotides or nucleic acids, and such compound can be natural or synthetic. 

10 As used herein, the term "complementary" refers to Watson-Crick basepairing 

between nucleotide units of a nucleic acid molecule. 

As used herein, the term "contacting" means bringing together, either directly 
or indirectly, a compound into physical proximity to a polypeptide or polynucleotide of 
the invention. The polypeptide or polynucleotide can be in any number of buffers, 

15 salts, solutions etc. Contacting includes, for example, placing the compound into a 
beaker, microtiter plate, cell culture flask, or a microarray, such as a gene chip, or the 
like, which contains the nucleic acid molecule, or polypeptide encoding the NgR or 
fragment thereof. 

As used herein, the phrase "homologous nucleotide sequence," or "homologous 
20 amino acid sequence," or variations thereof, refers to sequences characterized by an 
identity at the nucleotide level, or a homology at the amino acid level, of at least the 
specified percentage. Homologous nucleotide sequences include those sequences 
coding for isoforms of proteins. Such isoforms can be expressed in different tissues of 
the same organism as a result of, for example, alternative splicing of RNA. 
25 Alternatively, isoforms can be encoded by different genes, Homologous nucleotide 
sequences include nucleotide sequences encoding for a protein of a species other than 
humans, including, but not limited to, mammals. Homologous nucleotide sequences 
also include, but are not limited to, naturally occurring allelic variations and mutations 
of the nucleotide sequences set forth herein. A homologous nucleotide sequence does 
30 not, however, include the nucleotide sequence encoding NgRl . Homologous amino 
acid sequences include those amino acid sequences which contain conservative amino 
acid substitutions and which polypeptides have the same binding and/or activity. A 
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homologous amino acid sequence does not, however, include the amino acid sequence 
encoding other known NgRs. Percent homology can be determined by, for example, 
the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, Madison WI), using the default 
5 settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 
482-489, which is incorporated herein by reference in its entirety). 

As used herein, the term "isolated" nucleic acid molecule refers to a nucleic 
acid molecule (DNA or RNA) that is substantially free of nucleic acids encoding other 
proteins with which it is associated in nature, i.e., a nucleic acid that has been removed 

10 from its native environment. Examples of isolated nucleic acid molecules include, but 
are not limited to, recombinant DNA molecules contained in a vector, recombinant 
DNA molecules maintained in a heterologous host cell, partially or substantially 
purified nucleic acid molecules, and synthetic DNA or RNA molecules. Preferably, an 
"isolated" nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., 

1 5 sequences located at the 5* and 3 1 ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various 
embodiments, the isolated NgR nucleic acid molecule can contain less than about 50 
kb, 25 kb, 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which 
naturally flank the nucleic acid molecule in genomic DNA of the cell from which the 

20 nucleic acid is derived. Moreover, an "isolated" nucleic acid molecule, such as a 

cDNA molecule, can be substantially free of other cellular material or culture medium 
when produced by recombinant techniques, or of chemical precursors or other 
chemicals when chemically synthesized. 

As used herein, the term "heterologous" refers to a nucleotide or amino acid 

25 sequence that is a different, or non-corresponding sequence, or a sequence derived 

from a different species. For example, a mouse NgR nucleotide or amino acid sequence 
is heterologous to a human NgR nucleotide or amino acid sequence, and a human NgR 
nucleic or amino acid sequence is heterologous to a human immunoglobulin nucleotide 
or amino acid sequence. 

30 As used herein, a "soluble NgR polypeptide" is a NgR polypeptide that does 

not anchor itself in a membrane. Such soluble polypeptides include, for example, 
NgR2 and NgJR3 polypeptides that lack a sufficient portion of their GPI anchor signal 
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to anchor the polypeptide or are modified such that the GPI anchor signal is not 
adequate to result in replacement of the peptide with a GPI anchor. In preferred 
embodiments, up to 5, 10, 20 or 25 amino acids are removed from the C-terminus of 
NgR2 or NgR3 to make the respective proteins soluble. As used herein soluble NgR 
5 polypeptides include full-length or truncated (e.g. , with internal deletions) NgR. 

Soluble NgR polypeptides may include the entire NgR protein up to the 
putative GPI signal sequence (e.g., amino acid 1 to about amino acid 395 of NgR2, 
and from amino acid 1 to about amino acid 438 of NgR3). In other embodiments, the 
signal peptide of the proteins may be removed or truncated (e.g., all or part of the 
10 signal sequence of NgR2, which spans amino acid 1 to about amino acid 30 of SEQ ID 
NO:2, may be removed; all or part of the signal sequence of NgR3, which spans amino 
acid 1 to about amino acid 40 of SEQ ID NO:4, may be removed). In some 
embodiments, the mature NgR2 (SEQ ID NO:8) and the mature NgR3 (SEQ ID 
NO:9) are used. 

1 5 Soluble NgR polypeptides include at least one of the putative ligand-binding 

portions of NgR, including the first cysteine-rich region (SEQ ID NO: 10, the leucine 
repeat region (SEQ ID NO: 12) and the second cysteine-rich region (SEQ ID NO: 1 1). 
In some embodiments, soluble NgR polypeptides consist of amino acid 1 through 
about amino acid 395 of SEQ ID NO:2, or amino acid 1 through about amino acid 

20 438 ofSEQIDNO:4. 

In other embodiments, the soluble NgR polypeptides are fusion proteins that 
contain amino acids 30 through about amino acid 395 of mature NgR2 or amino acid 
40 through about amino acid 438 of NgR3, the C-terminal 10 amino acids of a human 
IgG 1 hinge region containing the two cysteine residues thought to participate in 

25 interchain disulfide bonding, and the CH2 and CH3 regions of a human IgGI heavy 
chain constant domain. This type of recombinant protein is designed to modulate 
inhibition of axonal elongation through inhibition of theNogo ligand binding to NgRl, 
or by inhibiting the ligand of the NgiR from interacting with cell surface NgR. The 
NgR portion of the fusion binds to the Nogo ligand and the IgGI portion binds to the 

30 FcyRI (macrophage) and FcyIQ (NK cells and neutrophils) receptors. 

The production of the soluble polypeptides useful in this invention may be 
achieved by a variety of methods known in the art. For example, the polypeptides may 
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be derived from intact transmembrane NgR molecules by proteolysis using specific 
endopeptidases in combination with exopeptidases, Edman degradation, or both. The 
intact NgR molecule, in turn, may be purified from its natural source using 
conventional methods. Alternatively, the intact NgR may be produced by known 
5 recombinant DNA techniques using cDNAs, expression vectors and well-known 
techniques for recombinant gene expression. 

Preferably, the soluble polypeptides useful in the present invention are 
produced directly, thus eliminating the need for an entire NgR as a starting material. 
This may be achieved by conventional chemical synthesis techniques or by well-known 

1 0 recombinant DNA techniques wherein only those DNA sequences which encode the 
desired peptides are expressed in transformed hosts. For example, a gene which 
encodes the desired soluble NgR polypeptide may be synthesized by chemical means 
using an oligonucleotide synthesizer. Such oligonucleotides are designed based on the 
amino acid sequence of the desired soluble NgR polypeptide. Specific DNA sequences 

15 coding for the desired peptide also can be derived from the full- length DNA sequence 
by isolation of specific restriction endonuclease fragments or by PCR synthesis of the 
specified region from cDNA. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NOs: 1, 3 or a complement of either of 

20 these nucleotide sequences, can be isolated using standard molecular biology 

techniques and the sequence information provided herein. Using all or a portion of the 
nucleic acid sequences of SEQ ID NOs:l or 3 as a hybridization probe, NgR nucleic 
acid sequences can be isolated using standard hybridization and cloning techniques 
(e.g., as described in Sambrook et aL, eds., MOLECULAR CLONING: A LABORATORY 

25 MANUAL 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989; and Ausubel, etal, eds., Current Protocols in Molecular Biology, John 
Wiley & Sons, New York, NY, 1993). 

A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 

30 according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
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Furthermore, oligonucleotides corresponding to NgR nucleotide sequences can be 
prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the terms "modulates" or "modifies" means an increase or 
decrease in the amount, quality, or effect of a particular activity or protein. 
5 As used herein, the term "oligonucleotide" refers to a series of linked 

nucleotide residues which has a sufficient number of bases to be used in a polymerase 
chain reaction (PCR). This short sequence is based on (or designed from) a genomic 
or cDNA sequence and is used to amplify, confirm or reveal the presence of an 
identical, similar or complementary DNA or feNA in a particular cell or tissue. 

10 Oligonucleotides comprise portions of a DNA sequence having at least about 10 
nucleotides and as many as about 50 nucleotides, preferably about 15 to 30 
nucleotides. They are chemically synthesized and may be used as probes. 

As used herein, the term "probe" refers to nucleic acid sequences of variable 
length, preferably between at least about 10 and as many as about 6,000 nucleotides, 

15 depending on use. They are used in the detection of identical, similar or 

complementary nucleic acid sequences. Longer length probes are usually obtained from 
a natural or recombinant source, are highly specific and much slower to hybridize than 
oligomers. They may be single- or double-stranded and carefully designed to have 
specificity in PCR, hybridization membrane-based, or ELIS A-like technologies. 

20 The term "preventing" refers to decreasing the probability that an organism 

contracts or develops an abnormal condition. 

The term "treating" refers to having a therapeutic effect and at least partially 
alleviating or abrogating an abnormal condition in the organism. 

The term "therapeutic effect" refers to the inhibition or activation factors 

25 causing or contributing to the abnormal condition. A therapeutic effect relieves to 

some extent one or more of the symptoms of the abnormal condition. In reference to 
the treatment of abnormal conditions, a therapeutic effect can refer to one or more of 
the following: (a) an increase in the proliferation, growth, and/or differentiation of 
cells; (b) inhibition (i.e., slowing or stopping) of cell death; (c) inhibition of 

30 degeneration; (d) relieving to some extent one or more of the symptoms associated 
with the abnormal condition; and (e) enhancing the function of the affected population 
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of cells. Compounds demonstrating efficacy against abnormal conditions can be 
identified as described herein. 

The term "abnormal condition" refers to a function in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnormal 
5 condition can relate to cell proliferation, cell differentiation, cell signaling, or cell 

survival. An abnormal condition may also include obesity, diabetic complications such 
as retinal degeneration, and irregularities in glucose uptake and metabolism, and fatty 
acid uptake and metabolism. 

Abnormal cell proliferative conditions, for example, include cancers such as 
10 fibrotic and mesangial disorders, abnormal angiogenesis and vasculogenesis, wound 
healing, psoriasis, diabetes mellitus and inflammation. 

Abnormal differentiation conditions include, for example, neurodegenerative 
disorders, slow wound healing rates and slow tissue grafting healing rates. 

Abnormal cell signaling conditions include, for example, psychiatric disorders 
15 involving excess neurotransmitter activity. 

Abnormal cell survival conditions may also relate to conditions in which 
programmed cell death (apoptosis) pathways are activated or abrogated. A number of 
protein kinases are associated with the apoptosis pathways. Aberrations in the 
function of any one of the protein kinases could lead to cell immortality or premature 
20 cell death. 

The term "administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated 
when the cells or tissues of the organism exist within the organism or outside of the 
organism. Cells existing outside the organism can be maintained or grown in cell 

25 culture dishes. For cells harbored within the organism, many techniques exist in the art 
to administer compounds, including (but not limited to) oral, parenteral, dermal, 
injection, and aerosol applications. For cells outside of the organism, multiple 
techniques exist in the art to administer the compounds, including (but not limited to) 
cell microinjection techniques, transformation techniques and carrier techniques. 

30 The abnormal condition can also be prevented or treated by administering a 

compound to a group of cells having an aberration in a signal transduction pathway to 
an organism. The effect of administering a compound on organism function can then 
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be monitored. The organism is preferably a mouse, rat, rabbit, guinea pig or goat, 
more preferably a monkey or ape, and most preferably a human. 

By "amplification" it is meant increased numbers of DNA or RNA in a cell 
compared with normal cells. "Amplification" as it refers to RNA can be the detectable 
5 presence of RNA in cells, since in some normal cells there is no basal expression of 
RNA In other normal cells, a basal level of expression exists, therefore in these cases 
amplification is the detection of at least 1-2-fold, and preferably more, compared to 
the basal level. 

The amino acid sequences are presented in the amino to carboxy direction, 
10 from left to right. The amino and carboxy groups are not presented in the sequence. 
The nucleotide sequences are presented by single strand only, in the 5* to 3' direction, 
from left to right. Nucleotides and amino acids are represented in the manner 
recommended by the IUPAOIUB Biochemical Nomenclature Commission or (for 
amino acids) by three letters code. 

15 

Nucleic Acids 

Genomic DNA of the invention comprises the protein-coding region for a 
polypeptide of the invention and is also intended to include allelic variants thereof. It is 
widely understood that, for many genes, genomic DNA is transcribed into RNA 

20 transcripts that undergo one or more splicing events wherein intron (i.e., non-coding 
regions) of the transcripts are removed, or "spliced out." RNA transcripts that can be 
spliced by alternative mechanisms, and therefore be subject to removal of different 
RNA sequences but still encode aNgR polypeptide, are referred to in the art as splice 
variants which are embraced by the invention. Splice variants comprehended by the 

25 invention therefore are encoded by the same original genomic DNA sequences but 
arise from distinct mRNA transcripts. Allelic variants are modified forms of a 
wild-type gene sequence, the modification resulting from recombination during 
chromosomal segregation or exposure to conditions which give rise to genetic 
mutation. Allelic variants, like wild-type genes, are naturally occurring sequences (as 

30 opposed to non-naturally occurring variants arising from in vitro manipulation). 

The invention also comprehends cDNA that is obtained through reverse 
transcription of an RNA polynucleotide encoding NgR (conventionally followed by 
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second-strand synthesis of a complementary strand to provide a double-stranded 
DNA). 

Preferred DNA sequences encoding a human NgR polypeptide is set out in 
SEQ ID NOs: 1 and 13. A preferred DNA of the invention comprises a double 
5 stranded molecule comprising the coding molecule (i.e., the "coding strand") along 
with the complementary molecule (the "non-coding strand" or "complement") having a 
sequence unambiguously deducible from the coding strand according to Watson-Crick 
base-pairing rules for DNA. Also preferred are other polynucleotides encoding NgR 
polypeptides, as shown in SEQ ID NO:3, which comprises murine NgR homolog, 
10 NgR3. 

Also preferred are nucleotide sequences that encode at least a portion of a NgR 
polypeptide that has at least one biological function of a NgR. More preferred are 
nucleotide sequences that encode a portion of NgR that encodes at least the mature 
NgR without the hydrophobic C-terminal GPI signal. Also preferred are nucleotide 
15 sequences that encode the portion of NgR that encodes at least the ligand-binding 
region of NgR. 

The invention further embraces other species, preferably mammalian, homologs 
of the human NgR DNA. Species homologs, sometimes referred to as "orthologs," in 
general, share at least 35%, at least 40%, at least 45%, at least 50%, at least 60%, at 

20 least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 
95%, at least 98%, or at least 99% homology with human DNA of the invention. 
Generally, percent sequence "homology" with respect to polynucleotides of the 
invention may be calculated as the percentage of nucleotide bases in the candidate 
sequence that are identical to nucleotides in the NgR sequences set forth in SEQ ID 

25 NOs:l, 3 or 13, after aligning the sequences and introducing gaps, if necessary, to 
achieve the maximum percent sequence identity. 

The polynucleotide sequence information provided by the invention makes 
possible large-scale expression of the encoded polypeptide by techniques well known 
and routinely practiced in the art. Polynucleotides of the invention also permit 

30 identification and isolation of polynucleotides encoding related NgR polypeptides, such 
as human allelic variants and species homologs, by well-known techniques including 
Southern and/or Northern hybridization, and polymerase chain reaction (PCR). 
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Examples of related polynucleotides include human and non-human genomic 
sequences, including allelic variants, as well as polynucleotides encoding polypeptides 
homologous to NgR and structurally related polypeptides sharing one or more 
biological, immunological, and/or physical properties of NgR. Non-human species 
5 genes encoding proteins homologous to NgR can also be identified by Southern and/or 
PCR analysis and are useful in animal models for NgR disorders. Knowledge of the 
sequence of a human NgR DNA also makes possible through use of Southern 
hybridization or polymerase chain reaction (PCR) the identification of genomic DNA 
sequences encoding NgR expression control regulatory sequences such as promoters, 

10 operators, enhancers, repressors, and the like. Polynucleotides of the invention are 
also useful in hybridization assays to detect the capacity of cells to express NgR. 
Polynucleotides of the invention may also provide a basis for diagnostic methods useful 
for identifying a genetic alteration(s) in a NgR locus that underlies a disease state or 
states, which information is useful both for diagnosis and for selection of therapeutic 

15 strategies. 

The disclosure herein of a fiill-length polynucleotide encoding a NgR 
polypeptide makes readily available to the worker of ordinary skill in the art every 
possible fragment of the fiill-length polynucleotide. The invention, therefore, provides 
fragments of NgR-encoding polynucleotides comprising at least 6, and preferably at 

20 least 14, 16, 18, 20, 25, 50, or 75 consecutive nucleotides of a polynucleotide 

encoding NgR. Preferably, fragments of polynucleotides of the invention comprise 
sequences unique to the NgR-encoding polynucleotide sequence, and therefore 
hybridize under highly stringent or moderately stringent conditions only {i.e., 
"specifically") to polynucleotides encoding NgR (or fragments thereof). 

25 Polynucleotide fragments of genomic sequences of the invention comprise not only 
sequences unique to the coding region, but also include fragments of the full-length 
sequence derived from introns, regulatory regions, and/or other non-translated 
sequences. Sequences unique to polynucleotides of the invention are recognizable 
through sequence comparison to other known polynucleotides, and can be identified 

30 through use of alignment programs routinely utilized in the art, e.g., those made 
available in public sequence databases. Such sequences also are recognizable from 
Southern hybridization analyses to determine the number of fragments of genomic 
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DNA to which a polynucleotide will hybridize. Polynucleotides of the invention can be 
labeled in a manner that permits their detection, including radioactive, fluorescent and 
enzymatic labeling. 

Fragments of polynucleotides are particularly useful as probes for detection of 
5 full-length or fragment of NgR polynucleotides. One or more polynucleotides can be 
included in kits that are used to detect the presence of a polynucleotide encoding NgR, 
or used to detect variations in a polynucleotide sequence encoding NgR. 

The invention also embraces DNAs encoding NgR polypeptides that hybridize 
under moderately stringent or high stringency conditions to the noncoding strand, or 

10 complement, of the polynucleotide in any of SEQ ID NOs: 1 or 3 . 

Stringent conditions are known to those skilled in the art and can be found in 
Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3. 176.3.6. Preferably, the conditions are such that sequences at least about 65%, 
70%, 75%, 85%, 90%, 95%, 98% or 99% homologous to each other typically remain 

1 5 hybridized to each other. A non-limiting example of stringent hybridization conditions 
is hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 
mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA and 500 mg/ml denatured salmon 
sperm DNA at 65 B C. This hybridization is followed by one or more washes in 0.2X 
SSC, 0.01% BSA at 50 B C. An isolated nucleic acid molecule of the invention that 

20 hybridizes under stringent conditions to the sequence of SEQ ID NOs: 1 or 3 
corresponds to a naturally occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having 
a nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 
As used herein, "stringent hybridization conditions" means: 42 8 C in a hybridization 

25 solution comprising 50% formamide, 1% SDS, 1 M NaCl, 10% (wt/vol) dextran 

sulfate, and washing twice for 30 minutes at 60°C in a wash solution comprising 0. 1 X 
SSC and 1% SDS. 

Vectors 

30 Another aspect of the present invention is directed to vectors, or recombinant 

expression vectors, comprising any of the nucleic acid molecules described above. 
Vectors are used herein either to amplify DNA or RNA encoding NgR and/or to 
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express DNA which encodes NgR. As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another nucleic acid to which it has been 
linked. One type of vector is a "plasmid", which refers to a circular double stranded 
DNA loop into which additional DNA segments can be ligated. Another type of 
5 vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into 
which they are introduced (e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
mammalian vectors) are integrated into the genome of a host cell upon introduction 

10 into the host cell, and thereby are replicated along with the host genome. Moreover, 
certain vectors are capable of directing the expression of genes to which they are 
operatively linked. Such vectors are referred to herein as "expression vectors". In 
general, expression vectors of utility in recombinant DNA techniques are often in the 
form of plasmids. In the present specification, "plasmid" and "vector" can be used 

15 interchangeably as the plasmid is the most commonly used form of vector. However, 
the invention is intended to include such other forms of expression vectors, such as 
viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
associated viruses), that serve equivalent functions. 

Expression of proteins in prokaryotes is most often carried out mE. coli with 

20 vectors containing constitutive or inducible promoters directing the expression of 

either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a 
protein encoded therein, usually to the amino terminus of the recombinant protein. 
Such fusion vectors typically serve three purposes: (1) to increase expression of 
recombinant protein; (2) to increase the solubility of the recombinant protein; and (3) 

25 to aid in the purification of the recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a proteolytic cleavage site is 
introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to 
purification of the fusion protein. Such enzymes, and their cognate recognition 

30 sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression 
vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson (1988) Gene 67, 
31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, 
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Piscataway, N. J.) that fuse glutathione-S-transferase (GST), maltose E binding protein, 
or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amranne/a/., (1988) Gene 69, 301-315) and pET lid (Studier etal, GENE 
5 Expression Technology: Methods in Enzymology 1 85, Academic Press, San 
Diego, CA. (1990) 60-89). 

One strategy to maximize recombinant protein expression in E. coli is to 
express the protein in host bacteria with an impaired capacity to proteolytically cleave 
the recombinant protein. See, Gottesman, GENE EXPRESSION TECHNOLOGY: 

10 METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, CA. (1990) 119-128. 
Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted 
into an expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E coli (Wadae/a/., (1992) Nucleic Acids Res, 20, 2111- 
2118). Such alteration of nucleic acid sequences of the invention can be carried out by 

1 5 standard DNA synthesis techniques. 

In another embodiment, the NgR expression vector is a yeast expression 
vector. Examples of vectors for expression in yeast S. cerevisiae include pYepSecl 
(Baldari, et al, (1987) EMBO 1 6, 229-234), pMFa (Kurjan and Herskowitz (1982) 
Cell 30, 933-943), pJRY88 (Schultzef a/., (1987) Gene 54, 113-123), pYES2 

20 (Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, 
CA). 

Alternatively, NgR can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect 
cells (e.g., SF9 cells) include the pAc series (Smith etal, (1983) MoL Cell. Biol. 3, 

25 21 56-2 1 65) and the pVL series (Lucklow and Summers (1 989) Virology 1 70, 3 1 -3 9). 
In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 329, 840) and pMT2PC 
(Kaufinanera/. (\9Zl)EMBOX 6, 187-195). When used in mammalian cells, the 

30 expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
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prokaiyotic and eukaryotic cells. See, e.g., Chapters 16 and 17 of Sambrook et al 9 
(Eds.) Molecular Cloning: A Laboratory Manual. 2 nd Ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 
In another embodiment, the recombinant, mammalian expression vector is 
5 capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 
(1987) Genes Dev. 1, 268-277), lymphoid-specific promoters (Calame and Eaton 

10 (1988) Adv. Immunol. 43, 235-275), in particular promoters of T cell receptors 

(Winoto and Baltimore (1989) EMBOJ. 8, 729-733) and immunoglobulins (Banerji et 
al. (1983) Cell 33, 729-740; Queen and Baltimore (1983) Cell 33, 741-748), neuron- 
specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl. Acad. Sci. USA 86, 5473-5477), pancreas-specific promoters (Edlund et al. 

15 (1985) Science 230, 912-916), and mammary gland-specific promoters (e.g., milk 
whey promoter; U.S. Pat. No. 4,873,316 and European Application Publication No. 
264,166). Developmentally-regulated promoters are also encompassed, e.g., the 
murine hox promoters (Kessel and Gruss (1990) Science 249, 374-379) and the 
cc-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 3, 537-546). 

20 The invention further provides a recombinant expression vector comprising a 

DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence 
in a manner that allows for expression (by transcription of the DNA molecule) of an 
RNA molecule that is antisense NgR mRNA Regulatory sequences operatively linked 

25 to a nucleic acid cloned in the antisense orientation can be chosen that direct the 
continuous expression of the antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory sequences can be chosen that 
direct constitutive, tissue-specific or cell-type-specific expression of antisense RNA, 
The antisense expression vector can be in the form of a recombinant plasmid, 

30 phagemid or attenuated virus in which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the activity of which can be determined 
by the cell type into which the vector is introduced. For a discussion of the regulation 
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of gene expression using antisense genes see Weintraub et al, Antisense UNA as a 
molecular tool for genetic analysis, REVIEWS— TRENDS IN GENETICS, Vol. 1(1) 1986. 

Preferred vectors include, but are not limited to, plasmids, phages, cosmids, 
episomes, viral particles or viruses and integratable DNA fragments (i.e., fragments 
5 integratable into the host genome by homologous recombination). Preferred viral 
particles include, but are not limited to, adenoviruses, baculoviruses, parvoviruses, 
herpesviruses, poxviruses, adeno-associated viruses, Semliki Forest viruses, vaccinia 
viruses and retroviruses. Preferred expression vectors include, but are not limited to, 
pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). Other expression vectors 

10 include, but are not limited to, pSPORT™ vectors, pGEM™ vectors (Promega), 

pPROEXvectors™ (LTI, Bethesda, MD), Bluescript™ vectors (Stratagene), pQE™ 
vectors (Qiagen), pSE420™ (Invitrogen) and pYES2™(Invitrogen). 

Preferred expression vectors are replicable DNA constructs in which a DNA 
sequence encoding NgR is operably linked or connected to suitable control sequences 

1 5 capable of effecting the expression of the NgR in a suitable host. DNA regions are 
operably linked or connected when they are functionally related to each other. For 
example, a promoter is operably linked or connected to a coding sequence if it controls 
the transcription of the sequence. Amplification vectors do not require expression 
control domains, but rather need only the ability to replicate in a host, usually 

20 conferred by an origin of replication, and a selection gene to facilitate recognition of 
transformants. The need for control sequences in the expression vector will vary 
depending upon the host selected and the transformation method chosen. Generally, 
control sequences include, but are not limited to a transcriptional promoter, enhancers, 
an optional operator sequence to control transcription, polyadenylation signals, a 

25 sequence encoding suitable mRNA ribosomal binding and sequences which control the 
termination of transcription and translation. Such regulatory sequences are described, 
for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN 
ENZYMOLOGY 185, Academic Press, San Diego, CA (1990). Regulatory sequences 
include those that direct constitutive expression of a nucleotide sequence in many types 

30 of host cell and those that direct expression of the nucleotide sequence only in certain 
host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those 
skilled in the art that the design of the expression vector can depend on such factors as 
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the choice of the host cell to be transformed, the level of expression of protein desired, 
etc. The expression vectors of the invention can be introduced into host cells to 
thereby produce proteins or peptides, including fusion proteins or peptides, encoded by 
nucleic acids as described herein (e.g., NgR proteins, mutant forms of NgR, fusion 
5 proteins, etc.). 

Preferred vectors preferably contain a promoter that is recognized by the host 
organism. The promoter sequences of the present invention may be prokaryotic, 
eukaryotic or viral. Examples of suitable prokaryotic sequences include the PR and PL 
promoters of bacteriophage lambda (The Bacteriophage Lambda, Hershey, A.D. 

10 (Ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1973), which is 
incorporated herein by reference in its entirety; LAMBDA II, Hendrix, R.W. (Ed.), Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1980), which is 
incorporated herein by reference in its entirety); the trp, recA, heat shock, and lacZ 
promoters of E. coli and the SV40 early promoter (Benoist et al t (1981) Nature 290, 

1 5 3 04-3 1 0, which is incorporated herein by reference in its entirety) . Additional 

promoters include, but are not limited to, mouse mammary tumor virus, long terminal 
repeat of human immunodeficiency virus, maloney virus, cytomegalovirus immediate 
early promoter, Epstein Barr virus, Rous sarcoma virus, human actin, human myosin, 
human hemoglobin, human muscle creatine and human metallothionein. 

20 Additional regulatory sequences can also be included in preferred vectors. 

Preferred examples of suitable regulatory sequences are represented by the 
Shine-Dalgarno sequence of the replicase gene of the phage MS-2 and of the gene ell 
of bacteriophage lambda. The Shine-Dalgarno sequence may be directly followed by 
DNA encoding NgR and result in the expression of the mature NgR protein. 

25 Moreover, suitable expression vectors can include an appropriate marker that 

allows the screening of the transformed host cells. The transformation of the selected 
host is carried out using any one of the various techniques well known to the expert in 
the art and described in Sambrook et aL t supra. 

An origin of replication can also be provided either by construction of the 

30 vector to include an exogenous origin or may be provided by the host cell 

chromosomal replication mechanism. If the vector is integrated into the host cell 
chromosome, the latter may be sufficient. Alternatively, rather than using vectors 
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which contain viral origins of replication, one skilled in the art can transform 
mammalian cells by the method of co-transformation with a selectable marker and NgR 
DNA. An example of a suitable marker is dihydrofolate reductase (DHFR) or 
thymidine kinase (see, U.S. Patent No. 4,399,216). 
5 Nucleotide sequences encoding NgR may be recombined with vector DNA in 

accordance with conventional techniques, including blunt-ended or staggered-ended 
termini for ligation, restriction enzyme digestion to provide appropriate termini, .filling 
in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable 
joining and ligation with appropriate ligases. Techniques for such manipulation are 
10 disclosed by Sambrook et al , supra and are well known in the art. Methods for 

construction of mammalian expression vectors are disclosed in, for example, Okayama 
etal, (1983)M>/. Cell Biol 3:280, Cosmmetal (1986) M?/. Immunol 23:935, 
Cosmmetal, (\9U)Nature 312:768, EP-A-0367566, and WO 91/18982, each of 
which is incorporated herein by reference in its entirety. 

15 

Host Cells and Transformed Host Cells 

According to another aspect of the invention, host cells are provided, including 
prokaryotic and eukaryotic cells, comprising a polynucleotide of the invention (or 
vector of the invention) in a manner that permits expression of the encoded NgR 

20 polypeptide. Preferably, the cell produces little or no endogenous NgR polypeptide. 
Polynucleotides of the invention may be introduced into the host cell as part of a 
circular plasmid, or as linear DNA comprising an isolated protein coding region or a 
viral vector. Methods for introducing DNA into the host cell that are well known and 
routinely practiced in the art include transformation, transfection, electroporation, 

25 nuclear injection, or fusion with carriers such as liposomes, micelles, ghost cells and 
protoplasts. Expression systems of the invention include bacterial, yeast, fungal, plant, 
insect, invertebrate, vertebrate and mammalian cells systems. 

Host cells of the invention are a valuable source of immunogen for 
development of antibodies specifically immunoreactive with NgR. Host cells of the 

30 invention are also useful in methods for the large-scale production of NgR 

polypeptides wherein the cells are grown in a suitable culture medium and the desired 
polypeptide products are isolated from the cells, or from the medium in which the cells 
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are grown, by purification methods known in the art, e.g., conventional 
chromatographic methods including immunoaflBnity chromatography, receptor affinity 
chromatography, hydrophobic interaction chromatography, lectin affinity 
chromatography, size exclusion filtration, cation or anion exchange chromatography, 
5 high pressure liquid chromatography (HPLC), reverse phase HPLC, and the like. Still 
other methods of purification include those methods wherein the desired protein is 
expressed and purified as a fusion protein having a specific tag, label or chelating 
moiety that is recognized by a specific binding partner or agent. The purified protein 
can be cleaved to yield the desired protein, or can be left as an intact fusion protein. 

10 Cleavage of the fusion component may produce a form of the desired protein having 
additional amino acid residues as a result of the cleavage process. 

Knowledge of NgR DNA sequences allows for modification of cells to permit, 
or increase, expression of endogenous NgR. Cells can be modified (e.g., by 
homologous recombination) to provide increased expression by replacing, in whole or 

15 in part, the naturally occurring NgR promoter with all or part of a heterologous 

promoter so that the cells express NgR at higher levels. The heterologous promoter is 
inserted in such a manner that it is operatively linked to endogenous NgR encoding 
sequences. (See, for example, PCT International Publication No. WO 94/12650, PCT 
International Publication No. WO 92/20808, and PCT International Publication No. 

20 WO 91/09955.) It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene 
which encodes carbamoyl phosphate synthase, aspartate transcarbamylase, and 
dihydroorotase) and/or intron DNA may be inserted along with the heterologous 
promoter DNA. If linked to the NgR coding sequence, amplification of the marker 

25 DNA by standard selection methods results in co-amplification of the NgR coding 
sequences in the cells. 

The DNA sequence information provided by the present invention also makes 
possible the development (e.g., by homologous recombination or "knock-out" 
strategies; see Capecchi, Science 244:1288-1292 (1989)) of animals that fail to express 

30 functional NgR or that express a variant of NgR. Such animals (especially small 

laboratory animals such as rats, rabbits and mice) are useful as models for studying the 
in vivo activities of NgR and modulators of NgR. 
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Suitable host cells for expression of the polypeptides of the invention include, 
but are not limited to, prokaryotes, yeast, and eukaryotes. If a prokaryotic expression 
vector is employed, then the appropriate host cell would be any prokaryotic cell 
capable of expressing the cloned sequences. Suitable prokaryotic cells include, but are 
5 not limited to, bacteria of the genera Escherichia, Bacillus, Salmonella, Pseudomonas, 
Streptomyces and Staphylococcus. 

If a eukaryotic expression vector is employed, then the appropriate host cell 
would be any eukaryotic cell capable of expressing the cloned sequence. Preferably, 
eukaryotic cells are cells of higher eukaryotes. Suitable eukaryotic cells include, but 

10 are not limited to, non-human mammalian tissue culture cells and human tissue culture 
cells. Preferred host cells include, but are not limited to, insect cells, HeLa cells, 
Chinese hamster ovary cells (CHO cells), African green monkey kidney cells (COS 
cells), human 293 cells, and murine 3T3 fibroblasts. Propagation of such cells in cell 
culture has become a routine procedure (see, Tissue Culture, Academic Press, Kruse 

15 and Patterson, Eds. (1973), which is incorporated herein by reference in its entirety). 

In addition, a yeast cell may be employed as a host cell. Preferred yeast cells 
include, but are not limited to, the genera Saccharomyces, Pichia andKluveromyces. 
Preferred yeast hosts are S. cerevisiae and P. pastoris. Preferred yeast vectors can 
contain an origin of replication sequence from a 2T yeast plasmid, an autonomously 

20 replication sequence (ARS), a promoter region, sequences for polyadenylation, 

sequences for transcription termination and a selectable marker gene. Shuttle vectors 
for replication in both yeast and K coli are also included herein. 

Alternatively, insect cells may be used as host cells. In a preferred 
embodiment, the polypeptides of the invention are expressed using a baculovirus 

25 expression system (see, Luckow et al y Bio/Technology, 1988, 6, 47; Baculovirus 
Expression Vectors: A Laboratory Manual, OTUelly et al (Eds.), W.H. 
Freeman and Company, New York, 1992; and U.S. Patent No. 4,879,236, each of 
which is incorporated herein by reference in its entirety). In addition, the MAXBAC™ 
complete baculovirus expression system (Invitrogen) can, for example, be used for 

30 production in insect cells. 

Suitable host cells are discussed further in Goeddel, GENE EXPRESSION 
Technology: Methods in Enzymology 185, Academic Press, San Diego, CA 
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(1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 
polymerase. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
5 conventional transformation or transfection techniques. As used herein, the terms 

"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
1 0 transfecting host cells can be found in Sambrook, et al (MOLECULAR CLONING: A 

Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon 
the expression vector and transfection technique used, only a small fraction of cells 
1 5 may integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Various 
selectable markers include those that confer resistance to drugs, such as G418, 
hygromycin, dihydrofolate reductase (DHFR) and methotrexate. Nucleic acid 
20 encoding a selectable marker can be introduced into a host cell on the same vector as 
that encoding NgR or can be introduced on a separate vector. Cells stably transfected 
with the introduced nucleic acid can be identified by drug selection (e.g., cells that 
have incorporated the selectable marker gene will survive, while the other cells die). 
In a preferred embodiment, the polypeptides of the invention, including forms 
25 ofNgR2 and NgR3, soluble forms of NgR, chimeric NgR polypeptides, Ng^/Ig 
fusions and fragments and variations of each of the above are expressed in Chinese 
Hamster Ovary (CHO) cells. 

In order to introduce the DNA fragment coding for the NgR protein or 
polypeptide into the CHO cell to express the recombinant NgR protein or polypeptide, 
30 it is necessary to construct the expression vector. 

The vectors for CHO expression include, but are not limited to, pAl-1 1, pXTl, 
pRc/CMV, pRc/RSV and pcDNAINeo. The promoter is not specifically limited 
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provided it effectively promotes expression in CHO cells. Examples of suitable 
promoters are; SRa, SV40, LTR, CMV, and HSV-TK. Of these, CMV and Sra 
promoters are preferred. 

In addition to the above-mentioned promoters, the expression vectors may 
5 contain enhancers, splicing signals, polyadenylation signals, selectable markers and an 
S V40 replication origin. Suitable selectable markers include, but are not limited to the 
dihydrofolate reductase (DHFR) gene which provides resistance to methotrexate 
(MTX), the ampicillin resistance gene, and the neomycin resistance gene. 

Examples of the expression vectors each containing the DNA coding for NgR, 

10 portions, fragments and soluble constructs thereof, include the vector (such as one 
described above), into which the promoter is operably linked (preferably upstream) to 
the nucleotide sequence encoding the desired NgR construct; a polyadenylation signal 
downstream from the nucleotide sequence encoding the NgR construct; and, 
preferably, the vector includes an operable DHFR gene. Preferably, the ampicillin 

15 resistant gene is also operably contained in the vector. 

CHO cell lacking the DHFR gene (Urlaub, G. et al, (1980) Proc. Natl Acad. 
Set USA 77, 4216-4220) and CHO-K1 (Proc. Natl Acad Set USA 60, 1275 (1968)) 
are suitable for use. 

The NgR expression vectors prepared as above are introduced into CHO cells 

20 by any known method, including, but not limited to the calcium phosphate method 

(Graham and van der Eb (1973) Virol 52, 456-467) and electroporation (Nuemann et 
al, (1982) EMBOJ. 1, 841-845). 

Transformants carrying the expression vectors are selected based on the 
above-mentioned selectable markers. Repeated clonal selection of the transformants 

25 using the selectable markers allows selection of stable cell lines having high expression 
of the NgR constructs. Increased MTX concentrations in the selection medium allows 
gene amplification and greater expression of the desired protein. The CHO cell 
containing the recombinant NgR can be produced by cultivating the CHO cells 
containing the NR expression vectors constitutively expressing the NgR constructs. 

30 Media used in cultivating CHO cells includes DMEM medium supplemented 

with about 0.5 to 20% fetal calf serum, DMEM medium and RPMI1640 medium. The 
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pH of the medium is preferably about 6 to 8. Cultivation is preferably at about 30 to 
40 9 C for about 15 to 72 hours with aeration. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) NgR protein. Accordingly, the 
5 invention further provides methods for producing NgR protein using the host cells of 
the invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding NgR has been 
introduced) in a suitable medium such that NgR protein is produced. In another 
embodiment, the method further comprises isolating NgR from the medium or the host 
10 cell. 

In situations where the NgR polypeptide will be found primarily intracellularly, 
intracellular material (including inclusion bodies for Gram-negative bacteria) can be 
extracted from the host cell using any standard technique known to one of ordinary 
skill in the art. Such methods would encompass, by way of example and not by way of 

15 limitation, lysing the host cells to release the contents of the periplasm/cytoplasm by 
French press, homogenization and/or sonication followed by centrifiigation. 

If the NgR polypeptide has formed inclusion bodies in the cytosol, such 
inclusion bodies may frequently bind to the inner and/or outer cellular membranes. 
Upon centrifiigation, the inclusion bodies will be found primarily in the pellet material. 

20 The pellet material can then be treated at pH extremes or with one or more chaotropic 
agents such as a detergent, guanidine, guanidine derivatives, urea, or urea derivatives 
in the presence of a reducing agent such as dithiothreitol at alkaline pH or 
tris-carboxyethyl phosphine at acid pH to release, break apart and solubilize the 
inclusion bodies. Once solubilized, NgR polypeptide can be analyzed using gel 

25 electrophoresis, immunoprecipitation or the like. Various methods of isolating the 
NgR polypeptide would be apparent to one of ordinary skill in the art, for example, 
isolation may be accomplished using standard methods such as those set forth below 
and in Marston et al (1990) Meth. Enzymol. 182, 264-275 (incorporated by reference 
herein in its entirety). 

30 If isolated NgR polypeptide is not biologically active following the isolation 

procedure employed, various methods for "refolding" or converting the polypeptide to 
its tertiary structure and generating disulfide linkages, can be used to restore biological 
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activity. Methods known to one of ordinary skill in the art include adjusting the pH of 
the solubilized polypeptide to a pH usually above 7 and in the presence of a particular 
concentration of a chaotrope. The selection of chaotrope is very similar to the choices 
used for inclusion body solubilization but usually at a lower concentration and is not 
5 necessarily the same chaotrope as used for the solubilization. It may be required to 
employ a reducing agent or the reducing agent plus its oxidized form in a specific ratio, 
to generate a particular redox potential allowing for disulfide shuffling to occur in the 
formation of the protein's cysteine bridge(s). Some of the commonly used redox 
couples include cysteine/cystamine, glutathione (GSH)/dithiobis GSH, cupric chloride, 
10 dithiothreitol (DTT)/dithiane DTT, 2-mercaptoethanol (bME)/dithio-b(ME). To 

increase the efficiency of the refolding, it may be necessary to employ a cosolvent, such 
as glycerol, polyethylene glycol of various molecular weights and arginine. 

Transgenic Animals 

1 5 The host cells of the invention can also be used to produce non-human 

transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which NgR-coding sequences have 
been introduced. Such host cells can then be used to create non-human transgenic 
animals in which exogenous NgR sequences have been introduced into their genome or 

20 homologous recombinant animals in which endogenous NgR sequences have been 
altered. Such animals are useful for studying the function and/or activity of NgR and 
for identifying and/or evaluating modulators of NgR activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes 

25 a transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that is 
integrated into the genome of a cell from which a transgenic animal develops and that 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 

30 used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous NgR gene has been altered 
by homologous recombination between the endogenous gene and an exogenous DNA 
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molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, 
prior to development of the animal. 

A transgenic animal of the invention can be created by introducing NgR- 
encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by 
5 microinjection, retroviral infection, and allowing the oocyte to develop in a 

pseudopregnant female foster animal. The human NgR DNA sequence of SEQ ID 
NOs:l or 3 can be introduced as a transgene into the genome of a non-human animal. 
Alternatively, a nonhuman homolog of the human NgR gene, such as a mouse NgR 
gene, can be isolated based on hybridization to the human NgR cDNA (described 

10 further above) and used as a transgene. Intronic sequences and polyadenylation signals 
can also be included in the transgene to increase the efficiency of expression of the 
transgene. A tissue-specific regulatory sequence(s) can be operably linked to the NgR 
transgene to direct expression of NgR protein to particular cells. Methods for 
generating transgenic animals via embryo manipulation and microinjection, particularly 

15 animals such as mice, have become conventional in the art and are described, for 

example, in U.S. Pat. Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan 1986, in 
Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY. Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of the 

20 NgR transgene in its genome and/or expression of NgR mRNA in tissues or cells of the 
animals. A transgenic founder animal can then be used to breed additional animals 
carrying the transgene. Moreover, transgenic animals carrying a transgene encoding 
NgR can further be bred to other transgenic animals carrying other transgenes. 
* To create a homologous recombinant animal, a vector is prepared which 

25 contains at least a portion of a NgR gene into which a deletion, addition or substitution 
has been introduced to thereby alter, e.g., functionally disrupt, the NgR gene. The 
NgR gene can be a human gene (e.g., SEQ ID NOs:l or 13), but more preferably, is a 
non-human homolog of a human NgR gene. For example, a mouse homolog of human 
NgR gene of SEQ ID NOs: 1 or 13 can be used to construct a homologous 

30 recombination vector suitable for altering an endogenous NgR gene in the mouse 
genome. In one embodiment, the vector is designed such that, upon homologous 
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recombination, the endogenous NgR gene is functionally disrupted (i.e. , no longer 
encodes a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that, upon homologous 
recombination, the endogenous NgR gene is mutated or otherwise altered but still 
5 encodes functional protein (e.g. , the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous NgR protein). In the homologous 
recombination vector, the altered portion of the NgR gene is flanked at its 5' and 3' 
ends by additional nucleic acid of the NgR gene to allow for homologous 
recombination to occur between the exogenous NgR gene carried by the vector and an 

10 endogenous NgR gene in an embryonic stem cell. The additional flanking NgR nucleic 
acid is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5 1 and 3 f 
ends) are included in the vector. See e.g. , Thomas et al (1987) Cell 51:503 for a 
description of homologous recombination vectors. The vector is introduced into an 

1 5 embryonic stem cell line (e.g. , by electroporation) and cells in which the introduced 
NgR gene has homologously recombined with the endogenous NgR gene are selected 
(see e.g., Li et al (1992) Cell 69:915). 

The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras. See e.g., Bradley 1987, In: 

20 Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, 

Robertson, ed. IRL, Oxford, pp. 113-152. A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and the embryo brought to term. 
Progeny harboring the homologously recombined DNA in their germ cells can be used 
to breed animals in which all cells of the animal contain the homologously recombined 

25 DNA by germline transmission of the transgene. Methods for constructing 
homologous recombination vectors and homologous recombinant animals are 
described further in Bradley (1991) Curr. Opin. Biotechnol. 2:823-829; PCT 
International Publication Nos.: WO 90/1 1354; WO 91/01 140; WO 92/0968; and WO 
93/04169. 

30 In another embodiment, transgenic non-humans animals can be produced that 

contain selected systems that allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. For 
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a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) Proc. 
Natl. Acad. Set. USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) 
Science 251:1351-1355. If a cre/loxP recombinase system is used to regulate 
5 expression of the transgene, animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such animals can be provided 
through the construction of "double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a selected protein and the 
other containing a transgene encoding a recombinase. 

10 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut et al. (1997) Nature 385:810- 
813. In brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 
induced to exit the growth cycle and enter G 0 phase. The quiescent cell can then be 
fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an animal 

15 of the same species from which the quiescent cell is isolated. The reconstructed 
oocyte is then cultured such that it develops to morula or blastocyte and then 
transferred to pseudopregnant female foster animal. The offspring borne of this female 
foster animal will be a clone of the animal from which the cell, e.g., the somatic cell, is 
isolated. 

20 

Antisense 

Also provided by the invention are antisense polynucleotides that recognize and 
hybridize to NgR polynucleotides. Full-length and fragment antisense polynucleotides 
are provided. Fragment antisense molecules of the invention include (i) those that 

25 specifically recognize and hybridize to NgR RNA (as determined by sequence 
comparison of DNA encoding NgR to DNA encoding other known molecules). 
Identification of sequences unique to NgR encoding polynucleotides can be deduced 
through use of any publicly available sequence database, and/or through use of 
commercially available sequence comparison programs. After identification of the 

30 desired sequences, isolation through restriction digestion or amplification using any of 
the various polymerase chain reaction techniques well known in the art can be 
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performed. Antisense polynucleotides are particularly relevant to regulating 
expression of NgR by those cells expressing NgR mRNA. 

Antisense oligonucleotides, or fragments of a nucleotide sequence set forth in 
SEQ ID NO:l, 3, 13 or sequences complementary or homologous thereto, derived 
5 from the nucleotide sequences of the present invention encoding NgR are useful as 
diagnostic tools for probing gene expression in various tissues. For example, tissue 
can be probed in situ with oligonucleotide probes carrying detectable groups by 
conventional autoradiography techniques to investigate native expression of this 
enzyme or pathological conditions relating thereto. In specific aspects, antisense 

10 nucleic acid molecules are provided that comprise a sequence complementary to at 

least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire NgR coding strand, or 
to only a portion thereof. Nucleic acid molecules encoding fragments, homologs, 
derivatives and analogs of a NgR protein of SEQ ID NO:2, 4 or 14 or antisense 
nucleic acids complementary to a NgR nucleic acid sequence of SEQ ID NOs: 1, 3 or 

15 1 3 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding NgR. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino acid residues (e.g., the protein coding region of human 

20 NgR corresponds to the coding region SEQ ID NO: 1, 3 or 13). In another 

embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" 
of the coding strand of a nucleotide sequence encoding NgR The term "noncoding 
region" refers to 5' and 3 1 sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions). 

25 Antisense oligonucleotides are preferably directed to regulatory regions of a 

nucleotide sequence of SEQ ID NO:l, 3, 13 or mRNA corresponding thereto, 
including, but not limited to, the initiation codon, TATA box, enhancer sequences, and 
the like. Given the coding strand sequences encoding NgR disclosed herein (e.g., SEQ 
ID NO:l, 3 or 13), antisense nucleic acids of the invention can be designed according 

30 to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic 
acid molecule can be complementary to the entire coding region of NgR mRNA, but 
more preferably is an oligonucleotide that is antisense to only a portion of the coding 
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or noncoding region of NgR mRNA. For example, the antisense oligonucleotide can 
be complementary to the region surrounding the translation start site of NgR mRNA. 
An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 
45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
5 constructed using chemical synthesis or enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 

10 antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 

1 5 carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 

20 mannosylqueosine, 5-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3 -(3 -amino-3 -N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 

25 diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

30 The antisense nucleic acid molecules of the invention (preferably 

oligonucleotides of 10 to 20 nucleotides in length) are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA 
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and/or genomic DNA encoding a NgR protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. Suppression of NgR 
expression at either the transcriptional or translational level is useful to generate 
cellular or animal models for diseases/conditions characterized by aberrant NgR 
5 expression. The hybridization can be by conventional nucleotide complementarity to 
form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule 
that binds to DNA duplexes, through specific interactions in the major groove of the 
double helix. 

Phosphorothioate and methylphosphonate antisense oligonucleotides are 

10 specifically contemplated for therapeutic use by the invention. The antisense 
oligonucleotides may be further modified by adding poly-L-lysine, transferrin 
polylysine or cholesterol moieties at their 5' end. 

An example of a route of administration of antisense nucleic acid molecules of 
the invention includes direct injection at a tissue site. Alternatively, antisense nucleic 

15 acid molecules can be modified to target selected cells and then administered 

systemically. For example, for systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or antigens expressed on a 
selected cell surface, e.g., by linking the antisense nucleic acid molecules to peptides or 
antibodies that bind to cell surface receptors or antigens. The antisense nucleic acid 

20 molecules can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of antisense molecules, vector constructs in 
which the antisense nucleic acid molecule is placed under the control of a strong pol II 
or pol HI promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 

25 is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual p-units, the strands run parallel to each other (Gaultier et aL, (1987) Nucleic 
Acids Res. 15, 6625-6641). The antisense nucleic acid molecule can also comprise a 
^-o-methylribonucleotide (Inoue et al, (1987) Nucleic Acids Res. 15, 613 1-6148) or a 

30 chimeric RNA-DNA analogue (Inoue et aL, (1987) FEES Lett. 215, 327-330). 

The NgR sequences taught in the present invention facilitate the design of 
novel transcription factors for modulating NgR expression in native cells and animals, 
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and cells transformed or transfected with NgR polynucleotides. For example, the 
Cys 2 -His 2 zinc finger proteins, which bind DNA via their zinc finger domains, .have 
been shown to be amenable to structural changes that lead to the recognition of 
different target sequences. These artificial zinc finger proteins recognize specific target 
5 sites with high affinity and low dissociation constants, and are able to act as gene 
switches to modulate gene expression. Knowledge of the particular NgR target 
sequence of the present invention facilitates the engineering of zinc finger proteins 
specific for the target sequence using known methods such as a combination of 
structure-based modeling and screening of phage display libraries (Segal et aL, (1999) 

10 Proc. Natl Acad Sci. USA 96, 2758-2763; Liu et aL, (1997) Proc. Natl, Acad Set 

USA 94, 5525-5530; Greisman et aL (1997) Science 275, 657-661; Choo et aL, (1997) 
J. MoL Biol 273 , 525-532). Each zinc finger domain usually recognizes three or more 
base pairs. Since a recognition sequence of 18 base pairs is generally sufficient in 
length to render it unique in any known genome, a zinc finger protein consisting of 6 

15 tandem repeats of zinc fingers would be expected to ensure specificity for a particular 
sequence (Segal et aL, (1999), above). The artificial zinc finger repeats, designed 
based on the promoter of NgR sequences, are fused to activation or repression 
domains to promote or suppress NgR expression (Liu et aL, (1997), above). The 
promoter of NgR may be obtained by standard methods known to one of ordinary skill 

20 in the art with the disclosure contained herein and knowledge of the NgR sequence. 
Alternatively, the zinc finger domains can be fused to the TATA box-binding factor 
(TBP) with varying lengths of linker region between the zinc finger peptide and the 
TBP to create either transcriptional activators or repressors (Kim et aL, (1997) Proc. 
NatL Acad, Set USA 94, 3616-3620. Such proteins and polynucleotides that encode 

25 them, have utility for modulating NgR expression in vivo in both native cells, animals 
and humans; and/or cells transfected with NgR-encoding sequences. The novel 
transcription factor can be delivered to the target cells by transfecting constructs that 
express the transcription factor (gene therapy), or by introducing the protein. 
Engineered zinc finger proteins can also be designed to bind RNA sequences for use in 

30 therapeutics as alternatives to antisense or catalytic RNA methods (McColl et aL , 
(1997) Proc. NatL Acad Set USA 96, 9521-9526); Wu et aL, (1995) Proc. NatL 
Acad. Set USA 92, 344-348). The present invention contemplates methods of 
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designing such transcription factors based on the gene sequence of the invention, as 
well as customized zinc finger proteins, that are useful to modulate NgR expression in 
cells (native or transformed) whose genetic complement includes these sequences. 

5 Ribozymes and PNA moieties 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes, 

10 described in Haselhoff and Gerlach (1988) Nature 334, 585-591) can be used to 
catalytically cleave NgR mRNA transcripts to thereby inhibit translation of NgR 
mRNA. A ribozyme having specificity for a NgR-encoding nucleic acid can be 
designed based upon the nucleotide sequence of a NgR DNA disclosed herein (i.e., 
SEQ ID NOs: 1, 3 or 13). For example, a derivative of a Tetrahymena L-19 IVS RNA 

15 can be constructed in which the nucleotide sequence of the active site is 

complementary to the nucleotide sequence to be cleaved in a NgR-encoding mRNA 
See, e.g., Cech etal U.S. Patent No. 4,987,071; and Cech etal U.S. Patent No. 
5, 1 16,742. Alternatively, NgR mRNA can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g. , Bartel et al. , 

20 (1993) Science 261, 1411-1418. 

Alternatively, NgR gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NgR (e.g. , the NgR 
promoter and/or enhancers) to form triple helical structures that prevent transcription 
of the NgR gene in target cells. See generally, Helene (1991) Anticancer Drug Des. 6: 

25 569-584; Helene. et al, (1992) Ann. N.Y. Acad Sci. 660:27-36; and Maher (1992) 
BioEssayslA, 807-815. 

In various embodiments, the nucleic acids of NgR can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 

30 backbone of the nucleic acids can be modified to generate peptide nucleic acids (see 
Hyrup et al., (1996) Bioorg. Med Chem. Lett. 4, 5-23). As used herein, the terms 
"peptide nucleic acids" or "PNAs 11 refer to nucleic acid mimics, e.g., DNA mimics, in 
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which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone 
and only the four natural nucleobases are retained. The neutral backbone of PNAs has 
been shown to allow for specific hybridization to DNA and RNA under conditions of 
low ionic strength. The synthesis of PNA oligomers can be performed using standard 
5 solid phase peptide synthesis protocols as described in Hyrup et al, (1996) above; 
Perry-O'Keefe et al, (1996) Proc. Natl Acad Set USA 93,14670-14675. 

PNAs of NgR can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing transcription or translation arrest or 

10 inhibiting replication. PNAs of NgR can also be used, e.g., in the analysis of single 
base pair mutations in a gene by, e.g., PNA directed PCR clamping; as artificial 
restriction enzymes when used in combination with other enzymes, e.g., SI nucleases 
(Hyrup (1996), above); or as probes or primers for DNA sequence and hybridization 
(Hyrup eta!., (1996), above; Perry-O'Keefe (1996), above). 

15 In another embodiment, PNAs of NgR can be modified, e.g. , to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques 
of drug delivery known in the art. For example, PNA-DNA chimeras of NgR can be 
generated that may combine the advantageous properties of PNA and DNA Such 

20 chimeras allow DNA recognition enzymes, e.g., RNase H and DNA polymerases, to 
interact with the DNA portion while the PNA portion would provide high binding 
affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate 
lengths selected in terms of base stacking, number of bonds between the nucleobases, 
and orientation (Hyrup (1996), above). The synthesis of PNA-DNA chimeras can be 

25 performed as described in Hyrup (1996), above and Finn et al (1996) Nucleic Acids 
Res. 24, 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, 
e .g^ 5-(4-methoxytrityl) amino-S'-deoxy-thymidine phosphoramidite, can be used 
between the PNA and the 5' end of DNA (Mag et al (1989) Nucleic Acids Res. 17, 

30 973-988). PNA monomers are then coupled in a stepwise manner to produce a 

chimeric molecule with a 5 1 PNA segment and a 3' DNA segment (Finn et al. (1996), 
above). Alternatively, chimeric molecules can be synthesized with a 5' DNA segment 
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and a 3' PNA segment. See, Petersen et al (1975) Bioorg. Med. Chem. Lett. 5:1119- 
1124. 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides {e.g., for targeting host cell receptors in vivo), or agents facilitating 
5 transport across the cell membrane (see Letsinger et al, (1989) Proc. Natl Acad. Set 
USA 86, 6553-6556; Lemaitre et aL, (1987) Proa Natl. Acad. Sci. USA 84, 648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization triggered cleavage agents (see, e.g., Krol et al, (1988) Biotechniques 6, 
10 958-976) or intercalating agents (see, e.g., Zon (1988) Pharm. Res. 5, 539-549). To 
this end, the oligonucleotide may be conjugated to another molecule, e.g., a peptide, a 
hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 

Automated sequencing methods can be used to obtain or verify the nucleotide 
1 5 sequence of NgR. The NgR nucleotide sequences of the present invention are believed 
to be 100% accurate. However, as is known in the art, nucleotide sequence obtained 
by automated methods may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90%, more typically at least about 95% to at 
least about 99.9% identical to the actual nucleotide sequence of a given nucleic acid 
20 molecule. The actual sequence may be more precisely determined using manual 

sequencing methods, which are well known in the art. An error in a sequence which 
results in an insertion or deletion of one or more nucleotides may result in a frame shift 
in translation such that the predicted amino acid sequence will differ from that which 
would be predicted from the actual nucleotide sequence of the nucleic acid molecule, 
25 starting at the point of the mutation. 

Polypeptides 

The invention also provides purified and isolated mammalian NgR polypeptides 
encoded by a polynucleotide of the invention. Presently preferred is a human NgR 
30 polypeptide comprising the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID 
NO: 14. Another preferred embodiment is a mouse NgR polypeptide comprising the 
amino acid sequence of NgR3, as set forth in SEQ ID NO:4. 
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One aspect of the invention pertains to isolated NgR proteins, and biologically 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also 
provided are polypeptide fragments suitable for use as immunogens to raise anti-NgR 
antibodies. Preferably, fragments of NgR proteins comprise at least one biological 
5 activity of NgR. In one embodiment, native NgR proteins can be isolated from cells or 
tissue sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, NgR proteins are produced by recombinant DNA 
techniques. Alternative to recombinant expression, a NgR protein or polypeptide can 
be synthesized chemically using standard peptide synthesis techniques. 

10 The invention also embraces polypeptides that have at least 99%, at least 95%, 

at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 65%, at 
least 60%, at least 55%, at least 50% or at least 45% identity and/or homology to the 
preferred polypeptide of the invention. In addition, the invention embraces 
polypeptides having the consensus sequence shown in SEQ ID NO: 6, shown in Table 

15 5) excluding the previously characterized NgR ("NgRl "), and polypeptides comprising 
at least about 90% of the consensus sequence. 

The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of 
positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case 

20 of nucleic acids) occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total number of positions in the region 
of comparison (i.e., the window size), and multiplying the result by 100 to yield the 
percentage of sequence identity. The term "substantial identity" as used herein denotes 
a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 

25 sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison region. 

In one aspect, percent homology is calculated as the percentage of amino acid 
residues in the smaller of two sequences which align with identical amino acid residue 

30 in the sequence being compared, when four gaps in a length of 100 amino acids may be 
introduced to maximize alignment (DayhofF, in ATLAS OF PROTEIN SEQUENCE AND 
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Structure, Vol. 5, p. 124, National Biochemical Research Foundation, Washington, 
D.C. (1972), incorporated herein by reference). 

A determination of homology or identity is typically made by a computer 
homology program known in the art. An exemplary program is the Gap program 
5 (Wisconsin Sequence Analysis Package, Version 8 for UNIX, Genetics Computer 
Group, University Research Park, Madison, WI) using the default settings, which uses 
the algorithm of Smith and Waterman (Adv. AppL Math, 1981, 2, 482-489, which in 
incorporated herein by reference in its entirety). Employing the GAP software 
provided in the GCG program package, (see Needleman and Wunsch (1970) 1 Mol 

1 0 Biol 48, 443-453) the following settings for nucleic acid sequence comparison may be 
used: GAP creation penalty of 5.0 and GAP extension penalty of 0.3, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 
identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with 
the CDS (encoding) part of the DNA sequence shown in SEQ ID NOs:l, 3 or 13. 

1 5 BestFit was originally written for Version 1 .0 by Paul Haeberli from a careful reading 
of the papers by Needleman and Wunsch (1970), above, and Smith and Waterman 
(1981), above. The following Bestfit settings for nucleic acid sequence comparison 
may be used: GAP creation penalty of 8.0 and GAP extension penalty of 2, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 

20 identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99%, with the 
CDS (encoding) part of the amino acid sequence shown in SEQ ID NOs:2, 4 or 14. 

Alternatively, homology may be determined by hybridization analysis wherein a 
nucleic acid sequence is hybridized to the complement of a sequence encoding the 
aforementioned proteins under stringent, moderately stringent, or low stringent 

25 conditions. See e.g. Ausubel, etal, (Eds.) Current Prqtocols in Molecular 
Biology, John Wiley & Sons, New York, NY, 1993, and below. 

Polypeptides of the invention may be isolated from natural cell sources or may 
be chemically synthesized, but are preferably produced by recombinant procedures 
involving host cells of the invention. 

30 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the NgR protein is derived, or substantially free from 
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chemical precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of NgR protein in which 
the protein is separated from cellular components of the cells from which it is isolated 
or recombinantly produced. In one embodiment, the language "substantially free of 
5 cellular material" includes preparations of NgR protein having less than about 30% (by 
dry weight) of non-NgR protein (also referred to herein as a "contaminating protein"), 
more preferably less than about 20% of non-NgR protein, still more preferably less 
than about 10% of non-NgR protein, and most preferably less than about 5% non-NgR 
protein. When the NgR protein or biologically active portion thereof is recombinantly 

10 produced, it is also preferably substantially free of culture medium, i.e., culture 

medium represents less than about 20%, more preferably less than about 10%, and 
most preferably less than about 5% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of NgR protein in which the protein is separated from chemical 

1 5 precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other 
chemicals" includes preparations of NgR protein having less than about 30% (by dry 
weight) of chemical precursors or non-NgR chemicals, more preferably less than about 
20% chemical precursors or non-NgR chemicals, still more preferably less than about 

20 10% chemical precursors or non-NgR chemicals, and most preferably less than about 
5% chemical precursors or non-NgR chemicals. 

Biologically active portions of a NgR protein include peptides comprising 
amino acid sequences sufficiently homologous to or derived from the amino acid 
sequence of the NgR protein, e.g., the amino acid sequence shown in SEQ ID NO:2, 4 

25 or 14 that include fewer amino acids than the full length NgR proteins, and exhibit at 
least one activity of a NgR protein. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the NgR protein. A biologically active 
portion of a NgR protein can be a polypeptide which is, for example, 10, 25, 50, 100 
or more amino acids in length. 

30 A biologically active portion of a NgR protein of the present invention may 

contain at least one of the features that is conserved between the NgR proteins (e.g., a 
conserved cysteine as the N-terminus of the mature protein, four conserved cysteines 
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in the N-terminus before a leucine-rich region, four conserved cysteines C-terminal 
with respect to a leucine repeat region, eight leucine-rich repeats, and a hydrophobic 
C-terminus). An alternative biologically active portion of a NgR protein may contain 
at least two of the above-identified domains. Another biologically active portion of a 
5 NgR protein may contain at least three of the above-identified domains. Yet another 
biologically active portion of a NgR protein of the present invention may contain at 
least four of the above-identified domains. 

Moreover, other biologically active portions, in which other regions of the 
protein are deleted, can be prepared by recombinant techniques and evaluated for one 

10 or more of the functional activities of a native NgR protein. 

In an embodiment, the NgR protein has an amino acid sequence shown in SEQ 
ID NO:2, 4 or 14. In other embodiments, the NgR protein is substantially homologous 
to SEQ ID NO:2, 4 or 14 and retains the functional activity of the protein of SEQ ID 
NO:2, 4 or 14, yet differs in amino acid sequence due to natural allelic variation or 

1 5 mutagenesis, as described in detail below. 

Accordingly, in another embodiment, the NgR protein is a protein that 
comprises an amino acid sequence at least about 45% homologous to the amino acid 
sequence of SEQ ID NO:2 or SEQ ID NO:4 or SEQ ID NO: 14 and retains the 
functional activity of the NgR proteins of SEQ ID NO:2, 4 or 14. 

20 Use of mammalian host cells is expected to provide for such post-translational 

modifications (e.g., glycosylation, truncation, lipidation and phosphorylation) as may 
be needed to confer optimal biological activity on recombinant expression products of 
the invention. Glycosylated and non-glycosylated forms of NgR polypeptides are 
embraced by the invention. 

25 The invention also embraces variant (or analog) NgR polypeptides. In one 

example, insertion variants are provided wherein one or more amino acid residues 
supplement a NgR amino acid sequence. Insertions may be located at either or both 
termini of the protein, or may be positioned within internal regions of the NgR amino 
acid sequence. Insertional variants with additional residues at either or both termini 

30 can include, for example, fusion proteins and proteins including amino acid tags or 
labels. 
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Insertion variants include NgR polypeptides wherein one or more amino acid 
residues are added to a NgR acid sequence or to a biologically active fragment thereof. 

Variant products of the invention also include mature NgR products, i.e., NgR 
products wherein leader or signal sequences are removed, with additional amino 
5 terminal residues. The additional amino terminal residues may be derived from another 
protein, or may include one or more residues that are not identifiable as being derived 
from specific proteins. NgR products with an additional methionine residue at 
position -1 (Met _1 -NgR) are contemplated, as are variants with additional methionine 
and lysine residues at positions -2 and -1 (Met^-Lys^-NgR). Variants of NgR with 
10 additional Met, Met-Lys, Lys residues (or one or more basic residues in general) are 
particularly useful for enhanced recombinant protein production in bacterial host cells. 

Polypeptide Variants 

The invention also embraces NgR variants having additional amino acid 

15 residues which result from use of specific expression systems. 

As used herein, a NgR "chimeric protein" or "fusion protein" comprises a NgR 
polypeptide operatively linked to a non-NgR polypeptide. A "NgR polypeptide" refers 
to a polypeptide having an amino acid sequence corresponding to NgR, whereas a 
"non-NgR polypeptide" refers to a polypeptide having an amino acid sequence 

20 corresponding to a protein that is not homologous to the NgR protein, e.g., a protein 
that is different from the NgR protein and that is derived from the same or a different 
organism. Within a NgR fusion protein the NgR polypeptide can correspond to all or a 
portion of a NgR protein. In one embodiment, a NgR fusion protein comprises at least 
one biologically active portion of a NgR protein. In another embodiment, a NgR 

25 fusion protein comprises at least two biologically active portions of a NgR protein. In 
yet another embodiment, a NgR fusion protein comprises at least three biologically 
active portions of a NgR protein. Within the fusion proteb, the term "operatively 
linked" is intended to indicate that the NgR polypeptide and the non-NgR polypeptide 
are fused in-frame to each other. The non-NgR polypeptide can be fused to the N- 

3 0 terminus or C-terminus of the NgR polypeptide. 

For example, in one embodiment a NgR fusion protein comprises a NgR 
domain operably linked to the extracellular domain of a second protein. Such fusion 
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proteins can be further utilized in screening assays for compounds which modulate 
NgR activity (such assays are described in detail below). 

For example, use of commercially available vectors that express a desired 
polypeptide as part of a glutathione-S-transferase (GST) fusion product provides the 
5 desired polypeptide having an additional glycine residue at position -1 after cleavage of 
the GST component from the desired polypeptide. 

In another embodiment, the fusion protein is a NgR protein containing a 
heterologous signal sequence at its N-terminus. For example, the native NgR signal 
sequence (i.e., amino acids 1-30 of SEQ ID NO:2 and amino acids 1-40 of SEQ ID 

10 NO:4) can be removed and replaced with a signal sequence from another protein. In 
certain host cells {e.g., mammalian host cells), expression and/or secretion NgR can be 
increased through use of a heterologous signal sequence. 

In yet another embodiment, the fusion protein is a NgR-immunoglobulin fusion 
protein in which the NgR sequences comprising one or more domains are fused to 

1 5 sequences derived from a member of the immunoglobulin protein family. The NgR- 
immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between NgR ligand and a NgR protein on the surface of a cell, to thereby suppress 
NgR-mediated signal transduction in vivo. NgR-immunoglobulin fusion proteins can be 

20 used to affect the bioavailability of a NgR cognate ligand. Inhibition of the NgR 
ligand/NgR interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the NgR-immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-NgR antibodies in a subject, to 

25 purify NgR ligands, and in screening assays to identify molecules that inhibit the 
interaction of NgR with NgR ligand. 

A NgR chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 

30 techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, 

restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive 
ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining and 
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enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers that give rise 
to complementary overhangs between two consecutive gene fragments that can 
5 subsequently be annealed and reamplified to generate a chimeric gene sequence (see, 
for example, Ausubel etal (Eds.) Current Protocols in Molecular Biology, 
John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). ANgR- 
encoding nucleic acid can be cloned into such an expression vector such that the fusion 
1 0 moiety is linked in-frame to the NgR protein. 

Variants resulting from expression in other vector systems are also 
contemplated. 

Insertional variants also include fusion proteins wherein the amino terminus 
and/or the carboxy terminus of NgR is/are fused to another polypeptide. 

15 In another aspect, the invention provides deletion variants wherein one or more 

amino acid residues in a NgR polypeptide are removed. Deletions can be effected at 
one or both termini of the NgR polypeptide, or with removal of one or more 
non-terminal amino acid residues of NgR. Deletion variants, therefore, include all 
fragments of a NgR polypeptide. 

20 The invention also embraces polypeptide fragments of the sequence set forth in 

SEQ ID NO:2, 4 or 14 wherein the fragments maintain biological (e.g., ligand binding 
and/or intracellular signaling) immunological properties of a NgR polypeptide. 
Fragments comprising at least 4, 5, 10, 15, 20, 25, 30, 35, or 40 consecutive amino 
acids of SEQ ID NO:2, 4 or 14 are contemplated by the invention. Preferred 

25 polypeptide fragments display antigenic properties unique to, or specific for, human 
NgR and its allelic and species homologs. Fragments of the invention having the 
desired biological and immunological properties can be prepared by any of the methods 
well known and routinely practiced in the art. 

In still another aspect, the invention provides substitution variants of NgR 

30 polypeptides. Substitution variants include those polypeptides wherein one or more 
amino acid residues of a NgR polypeptide are removed and replaced with alternative 
residues. In one aspect, the substitutions are conservative in nature; however, the 
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invention embraces substitutions that are also non-conservative. Conservative 
substitutions for this purpose may be defined as set out in Tables 2, 3, or 4 below. 
Table 1. 
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Variant polypeptides include those wherein conservative substitutions have 
been introduced by modification of polynucleotides encoding polypeptides of the 
invention. Amino acids can be classified according to physical properties and 
1 5 contribution to secondary and tertiary protein structure. A conservative substitution is 
recognized in the art as a substitution of one amino acid for another amino acid that 
has similar properties. Exemplary conservative substitutions are set out in Table 2 
(from WO 97/09433, page 10, published March 13, 1997 (PCT/GB96/02197, filed 
9/6/96), immediately below. 

20 

Table 2 
Conservative Substitutions I 
SIDE CHAIN 

CHARACTERISTIC AMINO ACID 



Aliphatic 
Non-polar 



GAP 
ILV 
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Polar - uncharged C S T M 

NQ 

Polar - charged D E 

KR 

Aromatic HFWY 

Other NQDE 
5 Alternatively, conservative amino acids can be grouped as described in 

Lehninger, [BIOCHEMISTRY, Second Edition; Worth Publishers, Inc. NY, NY 

(1975), pp.71-77] as set out in Table 3, immediately below. 

Table 3 

Conservative Substitutions n 

10 



15 



SIDE CHAIN 




CHARACTERISTIC 


AMINO ACID 


Non-polar (hydrophobic) 




A Aliphatic: 


ALIVP 


. B. Aromatic: 


FW 


C. Sulfur-containing: 
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D. Boderline: 
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Uncharged-polar 




A Hydroxyl: 


STY 


B. Amides: 


NQ 


C. Sylfhydryl: 
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D. Boderline: 
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Positively Charged (Basic): 


KRH 


Negatively Charged (Acidic): 


DE 



20 



As still another alternative, exemplary conservative substitutions are set out in 
25 Table 4, below. 

Table 4 

Conservative Substitutions HI 

Original Exemplary Substitution 
Residue 

Ala (A) Val, Leu, lie 

30 Arg (R) Lys, Gin, Asn 

Asn (N) Gin, His, Lys, Arg 

Asp (D) Glu 

Cys (C) Ser 

Gin (Q) Asn 

35 Glu (E) Asp 
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His (H) 


Asn, Gin, Lys, Arg 


ne© 


Leu, Val, Met, Ala, Phe, 


Leu (L) 


He, Val, Met, Ala, Phe 


Lys(K) 


Arg, Gin, Asn 


Met(M) 


Leu, Phe, lie 


Phe(F) 


Leu, Val, lie, Ala 


Pro(P) 


Gly 


Ser(S) 


Thr 


Thr(T) 


Ser 


Trp(W) 


Tyr 


Tyr(Y) 


Trp, Phe, Thr, Ser 


Val(V) 


He, Leu, Met, Phe, Ala 



In addition, amino acid residues that are conserved among family members of 
1 5 the NgR proteins of the present invention, as indicated by the alignment presented 
herein, are also predicted to be particularly unamenable to alteration. For example, 
NgR proteins of the present invention can contain at least one domain that is a typically 
conserved region in NgRs. Examples of these conserved domains include, e.g., 
leucine-rich repeat domain. Amino acid residues that are not conserved or are only 
20 semi-conserved among members of the NgR proteins may be readily amenable to 
alteration. 

Full-length NgRs have an LRR region characterized by the amino acid 
consensus sequence shown in SEQ ID NO: 19. At least some full-length NgRs also 
include a CT signaling (CTS) domain and a GPI domain. 
25 The NgR domain designations used herein are defined as follows: 
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131-154 
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142-165 
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LRR5 


155-178 


155-178 


156-179 
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166-189 
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125-148 
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439-462 



10 

In some embodiments of the invention, the above domains are modified. 
Modification can be in a manner that preserves domain functionality. Modification can 
include addition, deletion or substitution of certain amino acids. Exemplary 
modifications include conservative amino acid substitutions. Preferably such 

15 substitutions number 20 or fewer per 100 residues. More preferably, such 

substitutions number 10 or fewer per 100 residues. Further exemplary modifications 
include addition of flanking sequences of up to five amino acids at the N terminus 
and/or C terminus of one or more of the domains. 

In some embodiments, the isolated nucleic acid molecule encodes a polypeptide 

20 at least about 70%, 80%, 90%, 95%, 98%, and most preferably at least about 99% 
homologous to SEQ ID NO:2, 4 or 14. 

Mutations can be introduced into SEQ ID NOS:l, 3 or 13 by standard 
techniques, e.g., site-directed mutagenesis and PCR-mediated mutagenesis. 
Conservative amino acid substitutions can be made at one or more amino acid residues 

25 predicted to be non-essential, Alternatively, mutations can be introduced randomly 
along a NgR coding sequence. This can be accomplished, e.g., by saturation 
mutagenesis. The resulting mutants can be screened for NgR biological activity. 
Biological activities of NgR may include but are not limited to: (1) protein:protein 
interactions, e.g., with other NgRs or other cell-surface proteins involved in Nogo- 

30 related signaling; (2) complex formation with a NgR ligand; (3) binding to an anti- 
NgR antibody. 



WO 02/29059 



PCT7US01/31488 



-59- 

It should be understood that the definition of polypeptides of the invention is 
intended to include polypeptides bearing modifications other than insertion, deletion, 
or substitution of amino acid residues. By way of example, the modifications may be 
covalent in nature, and include for example, chemical bonding with polymers, lipids, 
5 other organic and inorganic moieties. Such derivatives may be prepared to increase 
circulating half-life of a polypeptide, or may be designed to improve the targeting 
capacity of the polypeptide for desired cells, tissues or organs. Similarly, the invention 
further embraces NgR polypeptides that have been covalently modified to include one 
or more water-soluble polymer attachments such as polyethylene glycol, 

10 poly oxy ethylene glycol or polypropylene glycol. Variants that display ligand binding 
properties of native NgR and are expressed at higher levels, as well as variants that 
provide for constitutively active receptors, are particularly useful in assays of the 
invention; the variants are also useful in providing cellular, tissue and animal models of 
diseases/conditions characterized by aberrant NgR activity. 

1 5 Chemically modified NgR polypeptide compositions in which the NgR 

polypeptide is linked to a polymer are included within the scope of the present 
invention. The polymer may be water soluble to prevent precipitation of the protein in 
an aqueous environment, such as a physiological environment. Suitable water-soluble 
polymers may be selected from the group consisting of, for example, polyethylene 

20 glycol (PEG), monomethoxypolyethylene glycol, dextran, cellulose, or other 
carbohydrate based polymers, poly-(N-vinyl pyrrolidone) polyethylene glycol, 
polypropylene glycol homopolymers, a polypropylene oxide/ethylene oxide copolymer 
polyoxyethylated polyols (e.g. glycerol) and polyvinyl alcohol. The selected polymer is 
usually modified to have a single reactive group, such as an active ester for acylation or 

25 an aldehyde for alkylation, so that the degree of polymerization may be controlled. 
Polymers may be of any molecular weight, and may be branched or unbranched, and 
mixtures of such polymers may also be used. When the chemically modified NgR 
polymer is destined for therapeutic use, pharmaceutically acceptable polymers will be 
selected for use. 

30 When the polymer is to be modified by an acylation reaction, the polymer 

should have a single reactive ester group. Alternatively, if the polymer is to be 
modified by reductive alkylation, the polymer should have a single reactive aldehyde 
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group. A preferred reactive aldehyde is polyethylene glycol propionaldehyde, which is 
water stable, or mono Cl-CIO alkoxy or aryloxy derivatives thereof (see U.S. Patent 
No. 5,252,714, incorporated by reference herein in its entirety). 

Pegylation of NgR polypeptides may be carried out by any of the pegylation 
5 reactions known in the art, as described, for example, in the following references: 
Focus on Growth Factors!, 4-10 (1992); EP 0 154 316; andEP 0 401 384 (each of 
which is incorporated by reference herein in its entirety). Preferably, the pegylation is 
carried out via an acylation reaction or an alkylation reaction with a reactive 
polyethylene glycol molecule (or an analogous reactive water-soluble polymer). A 

10 preferred water-soluble polymer for pegylation of polypeptides such as NgR is 
polyethylene glycol (PEG). As used herein, "polyethylene glycol" is meant to 
encompass any of the forms of PEG that have been used to derivatize other proteins, 
such as mono (Cl-CIO) alkoxy- or aryloxy-polyethylene glycol. 

Chemical derivatization of NgR polypeptides may be performed under any 

1 5 suitable conditions used to react a biologically active substance with an activated 

polymer molecule. Methods for preparing pegylated NgR polypeptides will generally 
comprise the steps of (a) reacting the polypeptide with polyethylene glycol, such as a 
reactive ester or aldehyde derivative of PEG, under conditions whereby NgR 
polypeptide becomes attached to one or more PEG groups, and (b) obtaining the 

20 reaction products. It will be apparent to one of ordinary skill in the art to select the 
optimal reaction conditions or the acylation reactions based on known parameters and 
the desired result. 

Pegylated and other polymenNgR polypeptides may generally be used to treat 
conditions that may be alleviated or modulated by administration of the NgR 

25 polypeptides described herein. However, the chemically-derivatized polymenNgR 
polypeptide molecules disclosed herein may have additional activities, enhanced or 
reduced biological activity, or other characteristics, such as increased or decreased 
half-life, as compared to the nonderivatized molecules. The NgR polypeptides, 
fragments thereof variants and derivatives, may be employed alone, together, or in 

30 combination with other pharmaceutical compositions. The cytokines, growth factors, 
antibiotics, antiinflammatories and/or chemotherapeutic agents as is appropriate for the 
indication being treated. 
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The present invention provides compositions comprising purified polypeptides 
of the invention. Preferred compositions comprise, in addition to the polypeptide of 
the invention, a pharmaceutical^ acceptable (i.e. 7 sterile and non-toxic) liquid, 
semisolid, or solid diluent that serves as a pharmaceutical vehicle, excipient or medium, 
5 Any diluent known in the art may be used. Exemplary diluents include, but are not 
limited to, water, saline solutions, polyoxyethylene sorbitan monolaurate, magnesium 
stearate, methyl- and propylhydroxybenzoate, talc, alginates, starches, lactose, sucrose, 
dextrose, sorbitol, mannitol, glycerol, calcium phosphate, mineral oil and cocoa butter. 
Variants that display ligand binding properties of native NgR and are expressed 

10 at higher levels, as well as variants that provide for constitutively active receptors, are 
particularly useful in assays of the invention; the variants are also useful in assays of the 
invention and in providing cellular, tissue and animal models of diseases/conditions 
characterized by aberrant NgR activity. 

With the knowledge of the nucleotide sequence information disclosed in the 

1 5 present invention, one skilled in the art can identify and obtain nucleotide sequences 
which encode NgR from different sources (i.e., different tissues or different organisms) 
through a variety of means well known to the skilled artisan and as disclosed by, for 
example, Sambrook etal, MOLECULAR CLONING: A LABORATORY MANUAL, Second 
Edition, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), which is 

20 incorporated herein by reference in its entirety. 

For example, DNA that encodes NgR may be obtained by screening of mRNA, 
cDNA, or genomic DNA with oligonucleotide probes generated from the NgR gene 
sequence information provided herein. Probes may be labeled with a detectable group, 
such as a fluorescent group, a radioactive atom or a chemiluminescent group in 

25 accordance with procedures known to the skilled artisan and used in conventional 
hybridization assays, as described by, for example, Sambrook et al (1989) above. 

A nucleic acid molecule comprising any of the NgR nucleotide sequences 
described above can alternatively be synthesized by use of the polymerase chain 
reaction (PCR) procedure, with the PCR oligonucleotide primers produced from the 

30 nucleotide sequences provided herein. See U.S. Patent Nos. 4,683, 195 to Mullis et al 
and 4,683,202 to Mullis. The PCR reaction provides a method for selectively 
increasing the concentration of a particular nucleic acid sequence even when that 
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sequence has not been previously purified and is present only in a single copy in a 
particular sample. The method can be used to amplify either single- or 
double-stranded DNA. The essence of the method involves the use of two 
oligonucleotide probes to serve as primers for the template-dependent, polymerase- 
5 mediated replication of a desired nucleic acid molecule. 

A wide variety of alternative cloning and in vitro amplification methodologies 
are well known to those skilled in the art. Examples of these techniques are found in, 
for example, Berger etal, Guide to Molecular Cloning Techniques, METHODS IN 
ENZYMOLOGY 152 Academic Press, San Diego, CA, which is incorporated herein by 
1 0 reference in its entirety. 

The nucleic acid molecules of the present invention, and fragments derived 
therefrom, are useful for screening for restriction fragment length polymorphism 
(RFLP) associated with certain disorders, as well as for genetic mapping. 

IS Antibodies 

Also comprehended by the present invention are antibodies (e.g., monoclonal 
and polyclonal antibodies, single chain antibodies, chimeric antibodies, 
bifiinctional/bispecific antibodies, humanized antibodies, human antibodies, and 
complementary determining region (CDR)-grafted antibodies, including compounds 

20 which include CDR sequences which specifically recognize a polypeptide of the 

invention) specific for NgR or fragments thereof. Preferred antibodies of the invention 
are human antibodies which are produced and identified according to methods 
described in W093/1 1236, published June 20, 1993, which is incorporated herein by 
reference in its entirety. Antibody fragments, including Fab, Fab', F(ab'>2> and F v , are 

25 also provided by the invention. The term "specific for," when used to describe 

antibodies of the invention, indicates that the variable regions of the antibodies of the 
invention recognize and bind NgR polypeptides exclusively (i.e., are able to distinguish 
NgR polypeptides from other known NgR polypeptides by virtue of measurable 
differences in binding affinity, despite the possible existence of localized sequence 

30 identity, homology, or similarity between NgR and such polypeptides). 

The antigenic peptide of NgR comprises at least 8 amino acid residues of the 
amino acid sequence shown in SEQ ID NO:2, 4 or 14 and encompasses an epitope of 
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NgR such that an antibody raised against the peptide forms a specific immune complex 
withNgR. Preferably, the antigenic peptide comprises at least 10 amino acid residues, 
more preferably at least 15 amino acid residues, even more preferably at least 20 amino 
acid residues, and most preferably at least 30 amino acid residues. Preferred epitopes 
5 encompassed by the antigenic peptide are regions of NgR that are located on the 
surface of the protein, e.g., hydrophilic regions. 

It will be understood that specific antibodies may also interact with other 
proteins (for example, S. aureus protein A or other antibodies in ELIS A techniques) 
through interactions with sequences outside the variable region of the antibodies, and, 

10 in particular, in the constant region of the molecule. Screening assays to determine 
binding specificity of an antibody of the invention are well known and routinely 
practiced in the art. For a comprehensive discussion of such assays, see Harlow et al. 
in Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press; Cold 
Spring Harbor, NY (1988), Chapter 6. Antibodies that recognize and bind fragments 

15 of the NgR polypeptides of the invention are also contemplated, provided that the 
antibodies are specific for NgR polypeptides. Antibodies of the invention can be 
produced using any method well known and routinely practiced in the art. 

For the production of polyclonal antibodies, various suitable host animals (e.g., 
rabbit, goat, mouse or other mammal) may be immunized by injection with the native 

20 protein, or a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, recombinantly expressed NgR 
protein or a chemically synthesized NgR polypeptide. The preparation can further 
include an adjuvant. Various adjuvants used to increase the immunological response 
include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., 

25 aluminum hydroxide), surface active substances (e.g. , lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, dinitrophenol, etc.), human adjuvants such as 
Bacille Calmette-Guerin and Corynebacterium parvum or similar immunostimulatory 
agents. If desired, the antibody molecules directed against NgR can be isolated from 
the mammal (e.g., from the blood) and further purified by well known techniques, such 

30 as protein A chromatography to obtain the IgG fraction. 

The term "monoclonal antibody" or "monoclonal antibody composition," as 
used herein, refers to a population of antibody molecules that contain only one species 
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of an antigen binding site capable of immunoreacting with a particular epitope of NgR. 
A monoclonal antibody composition thus typically displays a single binding affinity for 
a particular NgR protein with which it immunoreacts. For preparation of monoclonal 
antibodies directed towards a particular NgR protein, or derivatives, fragments, 
5 analogs or homologs thereof any technique that provides for the production of 
antibody molecules by continuous cell line culture may be utilized. Such techniques 
include, but are not limited to, the hybridoma technique (see Kohler and Milstein 
(1975) Nature 256, 495-497); the trioma technique; the human B-cell hybridoma 
technique (see Kozbor et al, (1983) Immunol. Today 4, 72) and the EBV hybridoma 

10 technique to produce human monoclonal antibodies (see Cole et al, (1985) in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 
Human monoclonal antibodies may be utilized in the practice of the present invention 
and may be produced by using human hybridomas (see Cote et al, (1983) Proc. Natl 
Acad Sci. USA 80, 2026-2030) or by transforming human B-cells with Epstein Ban* 

15 Virus in vitro (see Cole et a/., (1985), above). 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to a NgR protein (see e.g., U.S. Patent No! 4,946,778). 
In addition, methods can be adapted for the construction of Fab expression libraries 
(see e.g., Huse et al, (1989) Science 246, 1275-1281) to allow rapid and effective 

20 identification of monoclonal F^ fragments with the desired specificity for a NgR 

protein or derivatives, fragments, analogs or homologs thereof. Non-human antibodies 
can be "humanized" by techniques well known in the art. See e.g., U.S. Patent No. 
5,225,539. In one method, the non-human CDRs are inserted into a human antibody 
or consensus antibody framework sequence. Further changes can then be introduced 

25 into the antibody framework to modulate affinity or immunogenicity. Antibody 

fragments that contain the idiotypes to a NgR protein may be produced by techniques 
known in the art including, but not limited to: (i) an F(ab f ) 2 fragment produced by 
pepsin digestion of an antibody molecule; (ii) an Fab fragment generated by reducing 
the disulfide bridges of an F(ab f ) 2 fragment; (iii) an Fab fragment generated by the 

30 treatment of the antibody molecule with papain and a reducing agent and (iv) F v 
fragments. 
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Additionally, recombinant anti-NgR antibodies, such as chimeric and 
humanized monoclonal antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA techniques, are within the scope 
of the invention. Such chimeric and humanized monoclonal antibodies can be 
5 produced by recombinant DNA techniques known in the art, for example using 

methods described in PCT International Application No. PCT/US86/02269; European 
Patent Application No. 184,187; European Patent Application No. 171,496; European 
Patent Application No. 173,494; PCT International Publication No. WO 86/01533; 
U.S. Pat. No. 4,816,567; European Patent Application No. 125,023; Better et al, 

10 (1988) Science 240, 1041-1043; Liu etal, (1987) Proc. Natl Acad Set USA 84, 
3439-3443; Lin etal, (1987) J. Immunol 139, 3521-3526; Sun etal, (1987) Proc. 
Natl Acad Sci. USA 84, 214-218; Nishimura et al, (1987) Cancer Res. 47, 999-1005; 
Wood etal, (1985) Nature 314, 446-449; Shawefa/,. (1988) J. Natl Cancerlnst 
80, 1553-1559); Morrison (1985) Science 229, 1202-1207; Oietal, (1986) 

15 BioTechniques 4, 214; U.S. Patent. No. 5,225,539; Jones et al, (1986) Nature 321, 
552-525; Verhoeyan et al, (1988) Science 239, 1534; and Beidler et al, (1988) 1 
Immunol 141, 4053-4060. 

In a preferred embodiment of the invention a portion of a NgR is joined to an 
Fc portion of an antibody to form a NgR/Fc fusion protein. Preferably, the Ig fusion 

20 protein is soluble. The NgR/Fc fusion protein may be formed by recombinant 

techniques as described above. In one embodiment, a portion of a NgR including the 
entire amino acid sequence of NgR except the C-terminal hydrophobic region is fused 
to an Fc portion of an antibody. In preferred embodiments, the NgR is a human NgR 
and the Fc is also human. More preferably, the human Fc portion is derived from an 

25 IgG antibody. In other embodiments, the N-terminal signal sequence is omitted. Such 
antibodies are useful in binding Nogo to prevent Nogo signaling through the NgR. 

In one embodiment, methods for the screening of antibodies that possess the 
desired specificity include, but are not limited to, enzyme-linked immunosorbent assay 
(ELISA) and other immunologically-mediated techniques known within the art. In a 

30 specific embodiment, selection of antibodies that are specific to a particular domain of 
a NgR protein is facilitated by generation of hybridomas that bind to the fragment of a 
NgR protein possessing such a domain. Antibodies that are specific for one or more 
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domains within a NgR protein, e.g., domains spanning the above-identified conserved 
regions of NgRs, or derivatives, fragments analogs or homologs thereof, are also 
provided herein. 

Anti-NgR antibodies may be used in methods known within the art relating to 
5 the localization and/or quantitation of a NgR protein (e.g. , for use in measuring levels 
of the NgR protein within appropriate physiological samples, for use in diagnostic 
methods, for use in imaging the protein, and the like). In a given embodiment, 
antibodies for NgR proteins, or derivatives, fragments analogs or homologs thereof, 
that contain the antibody derived binding domain, are utilized as pharmacologically- 

10 active compounds [hereinafter "Therapeutics"]. 

An anti-NgR antibody (e.g., monoclonal antibody) can be used to isolate NgR 
by standard techniques, such as affinity chromatography or immunoprecipitation. An 
anti-NgR antibody can facilitate the purification of natural NgR from cells and of 
recombinantly produced NgR expressed in host cells. Moreover, an anti-NgR antibody 

15 can be used to detect NgR protein (e.g., in a cellular lysate or cell supernatant) in order 
to evaluate the abundance and pattern of expression of the NgR protein. Anti-NgR 
antibodies can be used diagnostically to monitor protein levels in tissue as part of a 
clinical testing procedure, e.g., to, for example, determine the efficacy of a given 
treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) 

20 the antibody to a detectable substance. Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials and radioactive materials. Examples of suitable enzymes 
include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include 

25 streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin and aequorin, and examples of suitable radioactive material include 

30 "V'V'Sor 3 * 

Another aspect of the present invention is directed to methods of inducing an 
immune response in a mammal against a polypeptide of the invention by administering 
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to the mammal an amount of the polypeptide sufficient to induce an immune response. 
The amount will be dependent on the animal species, size of the animal, and the like 
but can be determined by those skilled in the art. 

Another aspect of the invention is directed to anti-idiotypic antibodies and 
5 anti-anti-idiotypic antibodies. An anti-idiotypic antibody is an antibody that recognizes 
determinants of another antibody (a target antibody). Generally, the anti-idiotypic 
antibody recognizes determinants of the antigen-binding site of the target antibody. 
Typically, the target antibody is a monoclonal antibody. An anti-idiotypic antibody is 
generally prepared by immunizing an animal (particularly, mice) of the same species 

10 and genetic type as the source of the target monoclonal antibody, with the target 
monoclonal antibody. The immunized animal mounts an immune response to the 
idiotypic determinants of the target monoclonal antibody and produces antibodies 
against the idiotypic determinants of the target monoclonal antibody. 
Antibody-producing cells, such as splenic cells, of the immunized animal may be used 

15 to generate anti-idiotypic monoclonal antibodies. Furthermore, an anti-idiotypic 
antibody may also be used to immunize animals to produce anti-anti-idiotypic 
antibodies. These immunized animals may be used to generate anti-anti-idiotypic 
monoclonal antibodies using standard techniques. The anti-anti-idiotypic antibodies 
may bind to the same epitope as the original, target monoclonal antibody used to 

20 prepare the anti-idiotypic antibody. The anti-anti-idiotypic antibodies represent other 
monoclonal antibodies with the same antigen specificity as the original target 
monoclonal antibody. 

If the binding of the anti-idiotypic antibody with the target antibody is inhibited 
by the relevant antigen of the target antibody, and if the anti-idiotypic antibody induces 

25 an antibody response with the same specificity as the target antibody, it mimics the 
antigen of the target antibody. Such an anti-idiotypic antibody is an "internal image 
anti-idiotype" and is capable of inducing an antibody response as if it were the original 
antigen. (Bona and Kohler (1984) ANTI-IDIOTYPIC ANTIBODIES AND INTERNAL MAGE, 
in Monoclonal and anti-idiotypic antibodies: Probes for receptor structure 

30 AND FUNCTION, Venter J.C. et ah (Eds), Alan R. Liss, New York, NY, pp 141-149, 
1984). Vaccines incorporating internal image anti-idiotype antibodies have been 
shown to induce protective responses against viruses, bacteria, and parasites (Kennedy 
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etaU (1986)232, 220-223; 1047;McNamarae/*/., (\9%5) Science 226, 1325-1326). 
Internal image anti-idiotypic antibodies have also been shown to induce immunity to 
tumor related antigens (Raychauhuri et al, (1986) J. Immunol 137, 1743-1749; 
Raychauhurieftf/., (1987) J. Immunol 139, 3902-3910; Bhattacharya-Chatterjee 
5 al, (1987) J. Immunol 139, 1354-1360; Bhattacharya-Chatterjee et aL, (1988) J. 
Immunol 141, 1398-1403; Herlya etal (\9%9) Intern Rev. Immunol 4, 347-357; 
Chen et al (1990) Celllmm. Immunother. Cancer 351-359; Herlyn et al, (1991) in 
vivo 5, 615-624; Furuya et al, (1992) Anticancer Res. 12, 27-32; Mittelman, A. et al 
(1992) Proc. Natl Acad Sci. t USA 89, 466-470; Durrant etal, (1994) Cancer Res. 
10 54, 4837-4840; Mittelman. et al (1994) Cancer Res. 54, 415-421; Schmitt. et al 

(1994) Hybridoma 13, 389-396; Chakrobarty. etal (1995) J. Immunother. 18, 
95-103; Chakrobarty. etal. (1995) Cancer Res. 55, 1525-1530; Foon, K. A. etal 

(1995) Clin. Cancer Res. 1, 1205-1294; Herlyn et al (1995) Hybridoma 14, 159-166; 
Sclebuschefa/. (1995) Hybridoma 14, 167-174; Herlyn. etal (1996) Cancer 

15 Immunol Immunother. 43, 65-76). 

Anti-idiotypic antibodies for NgR may be prepared, for example, by 
immunizing an animal, such as a mouse, with a immunogenic amount of a composition 
comprising NgR2 (SEQ ID NO:2), NgR3 (SEQ ID NOs:4 or 14), or immunogenic 
portion thereof, containing at least one antigenic epitope of NgR. The composition 

20 may also contain a suitable adjuvant, and any carrier necessary to provide 

immunogenicity. Monoclonal antibodies recognizing NgR may be prepared from the 
cells of the immunized animal as described above. A monoclonal antibody recognizing 
an epitope of NgR is then selected and used to prepare a composition comprising an 
immunogenic amount of the anti-NgR monoclonal antibody. Typically, a 25 to 200 jig 

25 dose of purified anti-NgR monoclonal would be sufficient in a suitable adjuvant. 

Animals may be immunized 2-6 times at 14 to 30 day intervals between doses. 
Typically, animals are immunized by any suitable route of administration, such as 
intraperitoneal, subcutaneous, intravenous or a combination of these. Anti-idiotypic 
antibody production may be monitored during the immunization period using standard 

30 immunoassay methods. Animals with suitable titers of antibodies reactive with the 

target monoclonal antibodies may be reimmunized with the monoclonal antibody used 
as the immunogen three days before harvesting the antibody producing cells. 
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Preferably, spleen cells are used, although other antibody producing cells may be 
selected. Antibody-producing cells are harvested and fused with myeloma cells to 
produce Hybridomas, as described above, and suitable anti-idiotypic 
antibody-producing cells are selected. 
5 Anti-anti-idiotypic antibodies are produced by another round of immunization 

and Hybridoma production by using the anti-idiotypic monoclonal antibody as the 
immunogen. 

Antibodies of the invention are useful for, e.g., therapeutic purposes (by 
modulating activity of NgR), diagnostic purposes to detect or quantitate NgR, and 
10 purification of NgR. Therefore, kits comprising an antibody of the invention for any of 
the purposes described herein are also comprehended. 

Kits 

The present invention is also directed to kits, including pharmaceutical kits. 

15 The kits can comprise any of the nucleic acid molecules described above, any of the 
polypeptides described above, or any antibody which binds to a polypeptide of the 
invention as described above, as well appropriate controls, such as positive and/or 
negative controls. The kit preferably comprises additional components, such as, for 
example, instructions, solid support, reagents helpful for quantification, and the like. 

20 For example, the kit can comprise: a labeled compound or agent capable of detecting 
NgR protein or mRNA in a biological sample; means for determining the amount of 
NgR in the sample; and means for comparing the amount of NgR in the sample with a 
standard. The compound or agent can be packaged in a suitable container. 

25 Screening Assays 

The DNA and amino acid sequence information provided by the present 
invention also makes possible identification of binding partner compounds with which 
a NgR polypeptide or polynucleotide will interact. Methods to identify binding partner 
compounds include solution assays, in vitro assays wherein NgR polypeptides are 

30 immobilized and cell-based assays. Identification of binding partner compounds of 
NgR polypeptides provides candidates for therapeutic or prophylactic intervention in 
pathologies associated with NgR normal and aberrant biological activity. 
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The invention also provides a method (also referred to herein as a "screening 
assay") for identifying modulators, i.e., candidate or test compounds or agents (e.g., 
peptides, peptidomimetics, small molecules (e.g., molecules of less than 1,000 Daltons) 
or other drugs) that bind to NgR proteins or have a stimulatory- or inhibitory effect on, 
5 for example, NgR expression or NgR activity. 

In one embodiment, the invention provides assays for screening candidate or 
test compounds which bind to or modulate the activity of a NgR protein or 
polypeptide or biologically active portion thereof The test compounds of the present 
invention can be obtained using any of the numerous approaches in combinatorial 

10 library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 

15 non-peptide oligomer or small molecule libraries of compounds (Lam (1997) 
Anticancer Drug Des. 12, 145). 

Examples of methods for the synthesis of molecular libraries can be found in 
the art, for example in: DeWitt et al, (1993) Proa Natl. Acad Sci. USA 90, 6909; Erb 
etal, (1994) Proa Natl. Acad. Sci. USA 91,11422; Zuckermann et al (1994) J. Med 

20 Chem 37, 2678; Cho et al., (1993) Science 261, 1303; Carrell et al, (1994) Angew 
Chem. Int. Ed Engl. 33, 2059; Carell et al, (1994) Angew Chem. Int. Ed. Engl. 33, 
2061; and Gallop et al, (1994) J. Med Chem 37, 1233. 

Libraries of compounds may be presented in solution {e.g., Houghten (1992) 
BioTechniques 13, 412-421), or on beads (Lam (1991) Nature 354, 82-84), on chips 

25 (Fodor (1993) Nature 364, 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), 
spores (Ladner, above), plasmids (Cull etal. (1992) Proc. Natl. Acad. Sci. USA 89, 
1865-1869) or on phage (Scott and Smith (1990) Science 249, 386-390; Devlin (1990) 
Science 249, 404-406; Cwirla etal. (1990) Proc. Natl. Acad Sci. USA 87, 6378-6382; 
Felici (1991) X Mol. Biol. 222, 301-310; Ladner, above). 



30 
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1. Cell-based Assays 

The invention also provides cell-based assays to identify binding partner 
compounds of a NgR polypeptide. In one embodiment, the invention provides a 
method comprising the steps of contacting a NgR polypeptide expressed on the surface 
5 of a cell with a candidate binding partner compound and detecting binding of the 

candidate binding partner compound to the NgR polypeptide. In another embodiment, 
an assay is a cell-based assay comprising contacting a cell expressing a membrane- 
bound form of NgR protein, or a biologically active portion thereof, on the cell surface 
with a test compound and determining the ability of the test compound to modulate 
10 {e.g., stimulate or inhibit) the activity of the NgR protein or biologically active portion 
thereof. 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses a membrane-bound form of NgR protein, or a biologically active portion 
thereof, on the cell surface is contacted with a test compound and the ability of the test 

1 5 compound to bind to a NgR protein determined. The cell, for example, can be of 

mammalian origin or a yeast cell. Determining the ability of the test compound to bind 
to the NgR protein can be accomplished, for example, by coupling the test compound 
with a radioisotope or enzymatic label such that binding of the test compound to the 
NgR protein or biologically active portion thereof can be determined by detecting the 

20 labeled compound in a complex. For example, test compounds can be labeled with 
125 I, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope detected by 
direct counting of radioemission or by scintillation counting. Alternatively, test 
compounds can be enzymatically labeled with, for example, horseradish peroxidase, 
alkaline phosphatase or luciferase, and the enzymatic label detected by determination 

25 of conversion of an appropriate substrate to product. In one embodiment, the assay 
comprises contacting a cell which expresses a membrane-bound form of NgR protein 
or a biologically active portion thereof; on the cell surface with a known compound 
which binds NgR to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with a NgR 

30 protein, wherein determining the ability of the test compound to interact with a NgR 
protein comprises determining the ability of the test compound to preferentially bind to 
NgR or a biologically active portion thereof as compared to the known compound. 
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Determining the ability of the test compound to modulate the activity of NgR 
or a biologically active portion thereof can be accomplished, for example, by 
determining the ability of the NgR protein to bind to or interact with a NgR target 
molecule. As used herein, a "target molecule" is a molecule with which a NgR protein 
5 binds or interacts in nature, for example, a molecule on the surface of a cell which 
expresses a NgR protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell membrane 
or a cytoplasmic molecule. A NgR target molecule can be a non-NgR molecule or a 
NgR protein or polypeptide of the present invention. In one embodiment, a NgR 

10 target molecule is a component of a signal transduction pathway that facilitates 
transduction of an extracellular signal {e.g., a signal generated by binding of a 
compound to a membrane-bound NgR molecule) through the cell membrane and into 
the cell. The target, for example, can be a second intercellular protein that has 
catalytic activity or a protein that facilitates the association of downstream signaling 

15 molecules with NgR. In a preferred embodiment, the detection comprises detecting a 
calcium flux or other physiological event in the cell caused by the binding of the 
molecule. 

Specific binding molecules, including natural ligands and synthetic compounds, 
can be identified or developed using isolated or recombinant NgR products, NgR 
20 variants, or preferably, cells expressing such products. Binding partners are useful for 
purifying NgR products and detection or quantification of NgR products in fluid and 
tissue samples using known immunological procedures. Binding molecules are also 
manifestly useful in modulating (i.e., blocking, inhibiting or stimulating) biological 
activities of NgR, especially those activities involved in signal transduction. 

25 

2. Cell-free Assays 

(a) Direct binding: 

The invention includes several assay systems for identifying NgR binding 
partners. In solution assays, methods of the invention comprise the steps of (a) 
30 contacting a NgR polypeptide with one or more candidate binding partner compounds 
and (b) identifying the compounds that bind to the NgR polypeptide. Identification of 
the compounds that bind the NgR polypeptide can be achieved by isolating the NgR 
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polypeptide/binding partner complex and separating the binding partner compound 
from the NgR polypeptide. An additional step of characterizing the physical, biological 
and/or biochemical properties of the binding partner compound is also comprehended 
in another embodiment of the invention. In one aspect, the NgR polypeptide/binding 
5 partner complex is isolated using an antibody immunospecific for either the NgR 
polypeptide or the candidate binding partner compound. 

In still other embodiments, either the NgR polypeptide or the candidate binding 
partner compound comprises a label or tag that facilitates its isolation, and methods of 
the invention to identify binding partner compounds include a step of isolating the NgR 
1 0 polypeptide/binding partner complex through interaction with the label or tag. An 
exemplary tag of this type is a poly-histidine sequence, generally around six histidine 
residues, that permits isolation of a compound so labeled using nickel chelation. Other 
labels and tags, such as the FLAG® tag (Eastman Kodak, Rochester, NY), well known 
and routinely used in the art, are embraced by the invention. 

15 

(b) Immobilized NgR 

In one variation of an in vitro assay, the invention provides a method 
comprising the steps of (a) contacting an immobilized NgR polypeptide, or a 
biologically active fragment thereof with a candidate binding partner compound and (b) 

20 detecting binding of the candidate compound to the NgR polypeptide. In an 

alternative embodiment, the candidate binding partner compound is immobilized and 
, binding of NgR is detected. Immobilization is accomplished using any of the methods 
well known in the art, including covalent bonding to a support, a bead or a 
chromatographic resin, as well as non-covalent, high affinity interactions such as 

25 antibody binding, or use of streptavidin/biotin binding wherein the immobilized 
compound includes a biotin moiety. Binding of a test compound to NgR, or 
interaction of NgR with a target molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for containing the reactants. 
Examples of such vessels include microtiter plates, test tubes, and micro-centrifuge 

30 tubes. In one embodiment, a fusion protein can be provided that adds a domain that 
allows one or both of the proteins to be bound to a matrix. For example, and not by 
way of limitation, GST-NgR fusion proteins or GST-target fusion proteins can be 
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adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtiter plates, that are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or NgR 
protein, and the mixture is incubated under conditions conducive to complex formation 
5 (e.g., at physiological conditions for salt and pH). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound components, the matrix 
immobilized in the case of beads, and the complexes determined either direcdy or 
indirectly, for example, as described above. Alternatively, the complexes can be 
dissociated from the matrix, and the level of NgR binding or activity determined using 

10 standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either NgR or its target molecule can 
be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated NgR or 
target molecules can be prepared from biotin-NHS (N-hydroxy- succinimide) using 

15 techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, 
IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). Alternatively, antibodies reactive with NgR or target molecules, but which 
do not interfere with binding of the NgR protein to its target molecule, can be 
derivatized to the wells of the plate, and unbound target or NgR trapped in the wells 

20 by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the NgR or target molecule, as well as 
enzyme-linked assays that rely on detecting an enzymatic activity associated with the 
NgR or target molecule. 

25 Detection of binding can be accomplished (i) using a radioactive label on the 

compound that is not immobilized, (ii) using of a fluorescent label on the 
non-immobilized compound, (iii) using an antibody immunospecific for the 
non-immobilized compound, (iv) using a label on the non-immobilized compound that 
excites a fluorescent support to which the immobilized compound is attached, (v) 

30 determining the activity of the NgR, as well as other techniques well known and 
routinely practiced in the art. 
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Determining the activity of the target molecule, for example, may be 
accomplished by detecting induction of a cellular second messenger of the target (/.e. 
intracellular Ca 2+ , diacylglycerol, IP3, etc), detecting catalytic/enzymatic activity of the 
target an appropriate substrate, detecting the induction of a reporter gene (comprising 
5 a NgR-responsive regulatory element operatively linked to a nucleic acid encoding a 
detectable marker, e.g., luciferase), or detecting a cellular response, for example, cell 
survival, cellular differentiation, or cell proliferation. 

(c) Competition experiments 

10 In yet another embodiment, the assay comprises contacting the NgR protein or 

biologically active portion thereof with a known compound which binds NgR to form 
an assay mixture, contacting the assay mixture with a test compound, and determining 
the ability of the test compound to interact with a NgR protein, wherein determining 
the ability of the test compound to interact with a NgR protein comprises determining 
the ability of the test compound to preferentially bind to NgR or biologically active 
portion thereof as compared to the known compound. 

In yet another embodiment, the cell-free assay comprises contacting the NgR 
protein or biologically active portion thereof with a known compound which binds 
NgR to form an assay mixture, contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact with a NgR protein, wherein 
determining the ability of the test compound to interact with a NgR protein comprises 
determining the ability of the NgR protein to modulate the activity of a NgR target 
molecule. 

The cell-free assays of the present invention are amenable to use of both the 
soluble form or the membrane-bound form of NgR. In the case of cell-free assays 
comprising the membrane-bound form of NgR, it may be desirable to utilize a 
solubilizing agent such that the membrane-bound form of NgR is maintained in 
solution. Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
decanoyl-N-methylglucamide, Triton® X-100, Triton® X-114, Thesit®, 
Isotridecypoly(ethylene glycol ether) n , 3-(3-cholamidopropyl)dimethylamminiol-l- 
propane sulfonate (CHAPS), 3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l- 
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propane sulfonate (CHAPSO), or N-dodecyl-N,N-dimethyl-3 -ammonio- 1 -propane 
sulfonate. 

Modulators 

5 Agents that modulate {i.e., increase, decrease, or block) NgR activity or 

expression may be identified by incubating a putative modulator with a cell containing 
a NgR polypeptide or polynucleotide and determining the effect of the putative 
modulator on NgR activity or expression. The selectivity of a compound that 
modulates the activity of NgR can be evaluated by comparing its effects on NgR to its 

10 effect on other NgR compounds. Selective modulators may include, for example, 

antibodies and other proteins, peptides or organic molecules which specifically bind to 
a NgR polypeptide or a NgR-encoding nucleic acid. Modulators of NgR activity will 
be therapeutically useful in treatment of diseases and physiological conditions in which 
normal or aberrant NgR activity is involved. NgR polynucleotides, polypeptides and 

15 modulators may be used in the treatment of such diseases and conditions associated 
with demyelination. NgR polynucleotides and polypeptides, as well as NgR 
modulators, may also be used in diagnostic assays for such diseases or conditions. 

Methods of the invention to identify modulators include variations on any of 
the methods described above to identify binding partner compounds, the variations 

20 including techniques wherein a binding partner compound has been identified and the 
binding assay is carried out in the presence and absence of a candidate modulator. A 
modulator is identified in those instances where binding between the NgR polypeptide 
and the binding partner compound changes in the presence of the candidate modulator 
compared to binding in the absence of the candidate modulator compound. A 

25 modulator that increases binding between the NgR polypeptide and the binding partner 
compound is described as an enhancer or activator, and a modulator that decreases 
binding between the NgR polypeptide and the binding partner compound is described 
as an inhibitor. 

In another embodiment, modulators of NgR expression may be identified in a 
30 method wherein a cell is contacted with a candidate compound and the expression of 
NgR mRNA or protein in the cell is determined. The level of expression of NgR 
mRNA or protein in the presence of the candidate compound is compared to the level 
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of expression of NgR mRNA or protein in the absence of the candidate compound. 
The candidate compound can then be identified as a modulator of NgR expression 
based on this comparison. For example, when expression of NgR mRNA or protein is 
greater (statistically significantly greater) in the presence of the candidate compound 
5 than in its absence, the candidate compound is identified as a stimulator of NgR mRNA 
or protein expression. Alternatively, when expression of NgR mRNA or protein is less 
(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of NgR mRNA or protein 
expression. The level of NgR mRNA or protein expression in the cells can be 
10 determined by methods described herein for detecting NgR mRNA or protein. 

High Throughput Screening 

The invention also comprehends high-throughput screening (HTS) assays to 
identify compounds that interact with or inhibit biological activity (i.e. , affect 

15 enzymatic activity, binding activity, etc.) of a NgR polypeptide. HTS assays permit 
screening of large numbers of compounds in an efficient manner. Cell-based HTS 
systems are contemplated to investigate NgR receptor-ligand interaction. HTS assays 
are designed to identify "hits" or "lead compounds' 1 having the desired property, from 
which modifications can be designed to improve the desired property. Chemical 

20 modification of the "hit" or "lead compound" is often based on an identifiable 
structure/activity relationship between the "hit" and the NgR polypeptide. 

Another aspect of the present invention is directed to methods of identifying 
compounds that bind to either NgR or nucleic acid molecules encoding NgR, 
comprising contacting NgR, or a nucleic acid molecule encoding the same, with a 

25 compound, and determining whether the compound binds NgR or a nucleic acid 

molecule encoding the same. Binding can be determined by binding assays which are 
well known to the skilled artisan, including, but not limited to, gel-shift assays, 
Western blots, radiolabeled competition assay, phage-based expression cloning, 
co-fractionation by chromatography, co-precipitation, cross linking, interaction 

30 trap/two-hybrid analysis, southwestern analysis, ELISA, and the like, which are 

described in, for example, Ausubel eial (Eds.),CURRENT PROTOCOLS IN MOLECULAR 
BIOLOGY, 1999, John Wiley & Sons, NY, which is incorporated herein by reference in 
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its entirety. The NgR proteins, for example, can be used as "bait proteins" in a two- 
hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,3 17; Zervos et al., 
(1993) Cell 72, 223-232; Madura et al, (1993) J. Biol. Chem. 268, 12046-12054; 
Bartel et al, (1993) BioTechniques 14, 920-924; Iwabuchi et al, (1993) Oncogene 8, 
5 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact 
with NgR ("NgR-binding proteins" or "NgR-bp") and modulate NgR activity. Such 
Ngjt-binding proteins are also likely to be involved in the propagation of signals by the 
NgR proteins as, for example, upstream or downstream elements of the NgR pathway. 
Other assays may be used to identify specific ligands of a NgR receptor, 

10 including assays that identify ligands of the target protein through measuring direct 
binding of test ligands to the target protein, as well as assays that identify ligands of 
target proteins through affinity ultrafiltration with ion spray mass spectroscopy/HPLC 
methods or other physical and analytical methods. Alternatively, such binding 
interactions are evaluated indirectly using the yeast two-hybrid system described in 

15 Fields** a/., {\9%9) Nature 340, 245-246, and Fields etal, (1994) TrendsGenet. 10, 
286-292, both of which are incorporated herein by reference. The two-hybrid system 
is a genetic assay based on the modular nature of most transcription factors used for 
detecting interactions between two proteins or polypeptides. It can be used to identify 
proteins that bind to a known protein of interest, or to delineate domains or residues 

20 critical for an interaction. Variations on this methodology have been developed to 
clone genes that encode DNA binding proteins, to identify peptides that bind to a 
protein, and to screen for drugs. The two-hybrid system exploits the ability of a pair of 
interacting proteins to bring a transcription activation domain into close proximity with 
a DNA binding domain that binds to an upstream activation sequence (UAS) of a 

25 reporter gene, and is generally performed in yeast. The assay requires the construction 
of two hybrid genes encoding (1) a DNA-binding domain that is fused to a first protein 
and (2) an activation domain fused to a second protein. The DNA-binding domain 
targets the first hybrid protein to the UAS of the reporter gene; however, because most 
proteins lack an activation domain, this DNA-binding hybrid protein does not activate 

30 transcription of the reporter gene. The second hybrid protein, which contains the 

activation domain, cannot by itself activate expression of the reporter gene because it 
does not bind the UAS. However, when both hybrid proteins are present, the 
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noncovalent interaction of the first and second proteins tethers the activation domain 
to the UAS, activating transcription of the reporter gene. For example, when the first 
protein is a NgR gene product, or fragment thereof, that is known to interact with 
another protein or nucleic acid, this assay can be used to detect agents that interfere 
5 with the binding interaction. Expression of the reporter gene is monitored as different 
test agents are added to the system. The presence of an inhibitory agent results in lack 
of a reporter signal. The compounds to be screened include (which may include 
compounds that are suspected to bind NgR, or a nucleic acid molecule encoding the 
same), but are not limited to, extracellular, intracellular, biological or chemical origin. 

10 The function of the NgR gene product is unclear and no ligands have yet been 

found which bind the gene product. The yeast two-hybrid assay is useful to identify 
proteins that bind to the gene product. In an assay to identify proteins that bind to a 
NgR receptor, or fragment thereof, a fusion polynucleotide encoding both a NgR 
receptor (or fragment) and a UAS binding domain (i.e., a first protein) may be used. 

15 In addition, a large number of hybrid genes each encoding a different second protein 
fused to an activation domain are produced and screened in the assay. Typically, the 
second protein is encoded by one or more members of a total cDNA or genomic DNA 
fusion library, with each second protein-coding region being fused to the activation 
domain. This system is applicable to a wide variety of proteins, and it is not even 

20 necessary to know the identity or function of the second binding protein. The system 
. is highly sensitive and can detect interactions not revealed by other methods; even 
transient interactions may trigger transcription to produce a stable mRNA that can be 
repeatedly translated to yield the reporter protein. 

Other assays may be used to search for agents that bind to the target protein. 

25 One such screening method to identify direct binding of test ligands to a target protein 
is described in U.S. Patent No. 5,585,277, incorporated herein by reference. This 
method relies on the principle that proteins generally exist as a mixture of folded and 
unfolded states, and continually alternate between the two states. When a test ligand 
binds to the folded form of a target protein (i.e., when the test ligand is a ligand of the 

30 target protein), the target protein molecule bound by the ligand remains in its folded 
state. Thus, the folded target protein is present to a greater extent in the presence of a 
test ligand which binds the target protein, than in the absence of a ligand. Binding of 
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the ligand to the target protein can be determined by any method which distinguishes 
between the folded and unfolded states of the target protein. The function of the 
target protein need not be known in order for this assay to be performed. Virtually any 
agent can be assessed by this method as a test ligand, including, but not limited to, 
5 metals, polypeptides, proteins, lipids, polysaccharides, polynucleotides and small 
organic molecules. 

Another method for identifying ligands of a target protein is described in 
Wieboldt etal (1997) Anal Chem. 69:1683-1691, incorporated herein by reference. 
This technique screens combinatorial libraries of 20-30 agents at a time in solution 

10 phase for binding to the target protein. Agents that bind to the target protein are 
separated from other library components by simple membrane washing. The 
specifically selected molecules that are retained on the filter are subsequently liberated 
from the target protein and analyzed by HPLC and pneumatically assisted electrospray 
(ion spray) ionization mass spectroscopy. This procedure selects library components 

15 with the greatest affinity for the target protein, and is particularly useful for small 
molecule libraries. 

The methods of the invention also embrace ligands, especially neuropeptides, 
that are attached to a label, such as a radiolabel (e.g., 125 1, 35 S, 32 P, 33 P, 3 H), a 
fluorescence label, a chemiluminescent label, an enzymic label and an immunogenic 

20 label. Modulators falling within the scope of the invention include, but are not limited 
to, non-peptide molecules such as non-peptide mimetics, non-peptide allosteric 
effectors, and peptides. The NgR polypeptide or polynucleotide employed in such a 
test may either be free in solution, attached to a solid support, borne on a cell surface 
or located intracellularly or associated with a portion of a cell. One skilled in the art 

25 can, for example, measure the formation of complexes between NgR and the 
compound being tested. Alternatively, one skilled in the art can examine the 
diminution in complex formation between NgR and its substrate caused by the 
compound being tested. 

Another aspect of the present invention is directed to methods of identifying 

30 compounds which modulate (i.e. , increase or decrease) activity of NgR comprising 
contacting NgR with a compound, and determining whether the compound modifies 
activity of NgR. The activity in the presence of the test compared is measured to the 
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activity in the absence of the test compound. Where the activity of the sample 
containing the test compound is higher than the activity in the sample lacking the test 
compound, the compiound will have increased activity. Similarly, where the activity of 
the sample containing the test compound is lower than the activity in the sample 
5 lacking the test compound, the compound will have inhibited activity. 

The present invention is particularly useful for screening compounds by using 
NgR in any of a variety of drug screening techniques. The compounds to be screened 
include (which may include compounds which are suspected to modulate NgR 
activity), but are not limited to, extracellular, intracellular, biologic or chemical origin. 

10 The NgR polypeptide employed in such a test may be in any form, preferably, free in 
solution, attached to a solid support, borne on a cell surface or located intracellularly. 
One skilled in the art can, for example, measure the formation of complexes between 
NgR and the compound being tested. Alternatively, one skilled in the art can examine 
the diminution in complex formation between Nogo-R and its substrate caused by the 

1 5 compound being tested. 

The activity of NgR polypeptides of the invention can be determined by, for 
example, examining the ability to bind or be activated by chemically synthesized 
peptide ligands. Alternatively, the activity of the NgR can be assayed by examining 
their ability to bind calcium ions, hormones, chemokines, neuropeptides, 

20 neurotransmitters, nucleotides, lipids, odorants and photons. Alternatively, the activity 
of the NgR can be determined by examining the activity of effector molecules 
including, but not limited to, adenylate cyclase, phospholipases and ion channels. 
Thus, modulators of NgR activity may alter a NgR receptor function, such as a binding 
property of a receptor or an activity. In various embodiments of the method, the assay 

25 may take the form of an ion flux assay, a yeast growth assay, a non-hydrolyzable GTP 
assay such as a [ 35 S]-GTP S assay, a cAMP assay, an inositol triphosphate assay, a 
diacylglycerol assay, an Aequorin assay, a Luciferase assay, a FLDPR assay for 
intracellular Ca 2+ concentration, a mitogenesis assay, a MAP Kinase activity assay, an 
arachidonic acid release assay (e.g., using [ 3 H]-arachidonic acid) and an assay for 

30 extracellular acidification rates, as well as other binding or function-based assays of 
NgR activity that are generally known in the art. NgR activity can be determined by 
methodologies that are used to assay for FaRP activity, which is well known to those 
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skilled in the art. Biological activities of NgR receptors according to the invention 
include, but are not limited to, the binding of a natural or an unnatural ligand, as well 
as any one of the functional activities of NgRs known in the art. Non-limiting 
examples of NgR activities include transmembrane signaling of various forms, which 
5 may involve phosphatidylinositol (PI) association and/or the exertion of an influence 
over PI; another exemplary activity of NgRs is the binding of accessory proteins or 
polypeptides that differ from known GPI proteins. 

The modulators of the invention exhibit a variety of chemical structures, which 
can be generally grouped into non-peptide mimetics of natural NgR receptor ligands, 

10 peptide and non-peptide allosteric effectors of NgR receptors, and peptides that may 
function as activators or inhibitors (competitive, uncompetitive and non-competitive) 
{e.g., antibody products) of NgR receptors. The invention does not restrict the 
sources for suitable modulators, which may be obtained from natural sources such as 
plant, animal or mineral extracts, or non-natural sources such as small molecule 

15 libraries, including the products of combinatorial chemical approaches to library 
construction, and peptide libraries. 

Other assays can be used to examine enzymatic activity including, but not 
limited to, photometric, radiometric, HPLC, electrochemical, and the like, which are 
described in, for example, Enzyme Assays : A Practical Approach, Eisenthal and 

20 Danson (Eds.), 1992, Oxford University Press, which is incorporated herein by 
reference in its entirety. 

The use of cDNAs in drug discovery programs is well-known; assays capable 
of testing thousands of unknown compounds per day in high-throughput screens 
(HTSs) are thoroughly documented. The literature is replete with examples of the use 

25 of radiolabelled ligands in HTS binding assays for drug discovery (see Williams (1 99 1) 
Med. Res. Rev., 11, 147-184; Sweetnam et al, (1993) J. Nat. Prod. 56, 441-455 for 
review). Recombinant receptors are preferred for binding assay HTS because they 
allow for better specificity (higher relative purity), provide the ability to generate large 
amounts of receptor material, and can be used in a broad variety of formats (see 

30 Hodgson (1992) Bio/Technology 10, 973-980; each of which is incorporated herein by 
reference in its entirety). 
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A variety of heterologous systems is available for functional expression of 
recombinant receptors that are well known to those skilled in the art. Such systems 
include bacteria (Strosberg etal (1992) Trends Pharmacol Set 13, 95-98), yeast 
(Pausch (1997) Trends Biotechnol 15, 487-494), several kinds of insect cells (Vanden 
5 Broeck (1996) Int Rev. Cytol 164, 189-268), amphibian cells (Jayawickreme et al 
(1997) Curr. Opin. Biotechnol. 8, 629-634) and several mammalian cell lines (CHO, 
HEK293, COS, etc.; see Gerhardt etal (1997) Eur. J. Pharmacol 334, 1-23). These 
examples do not preclude the use of other possible cell expression systems, including 
cell lines obtained from nematodes (PCT application WO 98/37177). 

10 In preferred embodiments of the invention, methods of screening for 

compounds which modulate NgR activity comprise contacting test compounds with 
NgR and assaying for the presence of a complex between the compound and NgR. In 
such assays, the ligand is typically labeled. After suitable incubation, free ligand is 
separated from that present in bound form, and the amount of free or uncomplexed 

15 label is a measure of the ability of the particular compound to bind to NgR. 

In another embodiment of the invention, high throughput screening for 
compounds having suitable binding affinity to NgR is employed. Briefly, large 
numbers of different small peptide test compounds are synthesized on a solid substrate. 
The peptide test compounds are contacted with NgR and washed. Bound NgR is then 

20 detected by methods well known in the art. Purified polypeptides of the invention can 
also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the protein 
and immobilize it on the solid support. 

Generally, an expressed NgR can be used for HTS binding assays in 

25 conjunction with its defined ligand. The identified peptide is labeled with a suitable 
radioisotope, including, but not limited to, 125 I, 3 H, 35 S or 32 P, by methods that are 
well known to those skilled in the art. Alternatively, the peptides may be labeled by 
well-known methods with a suitable fluorescent derivative (Baindur et al (1994) Drug 
Dev. Res. 33, 373-398; Rogers (1997) Drug Discov. Today 2, 156-160). Radioactive 

30 ligand specifically bound to the receptor in membrane preparations made from the cell 
line expressing the recombinant protein can be detected in HTS assays in one of 
several standard ways, including filtration of the receptor-ligand complex to separate 
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bound ligand from unbound ligand (Williams (1991) Med Res. Rev. 11, 147-184; 
Sweetnam et al (1993) J. Nat. Prod 56, 441-455). Alternative methods include a 
scintillation proximity assay (SPA) or a FlashPlate format in which such separation is 
unnecessary (Nakayama (1998) Curr. Opin. Drug Disc. Dev. 1, 85-91 Bosse et al. 
5 (1998) J. Biomol Screening 3, 285-292). Binding of fluorescent ligands can be 
detected in various ways, including fluorescence energy transfer (FRET), direct 
spectrophotofluorometric analysis of bound ligand, or fluorescence polarization 
(Rogers (1997) DrugDiscov. Today 2, 156-160; Hill (1998) Curr. Opin. Drug Disc. 
Dev. 1, 92-97). 

1 0 Examples of such biological responses include, but are not limited to, the 

following: the ability to survive in the absence of a limiting nutrient in specifically 
engineered yeast cells (Pausch (1997) Trends in Biotechnol 15, 487-494); changes in 
intracellular Ca 2+ concentration as measured by fluorescent dyes (Murphy et al. (1998) 
Cur. Opin. Drug Disc. Dev. 1, 192-199). Fluorescence changes can also be used to 

15 monitor ligand-induced changes in membrane potential or intracellular pH; an 

automated system suitable for HTS has been described for these purposes (Schroeder 
et al (1996) J. Biomol Screening 1, 75-80). Melanophores prepared from Xenopus 
laevis show a ligand-dependent change in pigment organization in response to 
heterologous NgR activation; this response is adaptable to HTS formats 

20 (Jayawickreme et al (1997) Curr. Opin. Biotechnol. 8, 629-634). Assays are also 
available for the measurement of common second messengers, including cAMP, 
phosphoinositides and arachidonic acid, but these are not generally preferred for HTS. 

Preferred methods of HTS employing these receptors include permanently 
transfected CHO cells, in which agonists and antagonists can be identified by the ability 

25 to transduce the signal for the binding of Nogo in membranes prepared from these cells 
through the putative GPI anchor. In another embodiment of the invention, 
permanently transfected CHO cells could be used for the preparation of membranes 
which contain significant amounts of the recombinant receptor proteins; these 
membrane preparations would then be used in receptor binding assays, employing the 

30 radiolabeled ligand specific for the particular receptor. Alternatively, a functional 
assay, such as fluorescent monitoring of ligand-induced changes in internal Ca 
concentration or membrane potential in permanently transfected CHO cells containing 
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each of these receptors individually or in combination would be preferred for HTS. 
Equally preferred would be an alternative type of mammalian cell, such as HEK293 or 
COS cells, in similar formats. More preferred would be permanently transfected insect 
cell lines, such as Drosophila S2 cells. Even more preferred would be recombinant 
5 yeast cells expressing the Drosophila melanogaster receptors in HTS formats well 
known to those skilled in the art (e.g., Pausch (1997), above). 

The invention contemplates a multitude of assays to screen and identify 
inhibitors of ligand binding to NgR receptors. In one example, the NgR receptor is 
immobilized and interaction with a binding partner is assessed in the presence and 

10 absence of a candidate modulator such as an inhibitor compound. In another example, 
interaction between the NgR receptor and its binding partner is assessed in a solution 
assay, both in the presence and absence of a candidate inhibitor compound. In either 
assay, an inhibitor is identified as a compound that decreases binding between the NgR 
receptor and its binding partner. Another contemplated assay involves a variation of 

15 the di-hybrid assay wherein an inhibitor of protein/protein interactions is identified by 
detection of a positive signal in a transformed or transfected host cell, as described in 
PCT publication number WO 95/20652, published August 3, 1995. 

Candidate modulators contemplated by the invention include compounds 
selected from libraries of either potential activators or potential inhibitors. There are a 

20 number of different libraries used for the identification of small molecule modulators, 
including: (1) chemical libraries, (2) natural product libraries, and (3) combinatorial 
libraries comprised of random peptides, oligonucleotides or organic molecules. 
Chemical libraries consist of random chemical structures, some of which are analogs of 
known compounds or analogs of compounds that have been identified as "hits" or 

25 "leads" in other drug discovery screens, some of which are derived from natural 
products, and some of which arise from non-directed synthetic organic chemistry. 
Natural product libraries are collections of microorganisms, animals, plants, or marine 
organisms that are used to create mixtures for screening by: (1) fermentation and 
extraction of broths from soil, plant or marine microorganisms or (2) extraction of 

30 plants or marine organisms. Natural product libraries include polyketides, 

non-ribosomal peptides, and variants (non-naturally occurring) thereof. For a review, 
see Cane et al, Science (1998) 282, 63-68. Combinatorial libraries are composed of 
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large numbers of peptides, oligonucleotides, or organic compounds as a mixture. 
These libraries are relatively easy to prepare by traditional automated synthesis 
methods, PCR, cloning, or proprietary synthetic methods. Of particular interest are 
non-peptide combinatorial libraries. Still other libraries of interest include peptide, 
5 protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and 
polypeptide libraries. For a review of combinatorial chemistry and libraries created 
therefrom, see Myers (1997) Curr. Opin. Biotechnol. 8, 701-707. Identification of 
modulators through use of the various libraries described herein permits modification 
of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to modulate 
10 activity. 

Still other candidate inhibitors contemplated by the invention can be designed 
and include soluble forms of binding partners, as well as such binding partners as 
chimeric, or fusion, proteins. A "binding partner" as used herein broadly encompasses 
non-peptide modulators, as well as such peptide modulators as neuropeptides other 

15 than natural ligands, antibodies, antibody fragments, and modified compounds 

comprising antibody domains that are immunospecific for the expression product of the 
identified NgR gene. 

Other embodiments of the invention comprise using competitive screening 
assays in which neutralizing antibodies capable of binding a polypeptide of the 

20 invention specifically compete with a test compound for binding to the polypeptide. In 
this manner, the antibodies can be used to detect the presence of any peptide that 
shares one or more antigenic determinants with NgR. Radiolabeled competitive 
binding studies are described in Lin et aL, (1997) Antimicrob. Agents Chemother. 41, 
2127-213 1, the disclosure of which is incorporated herein by reference in its entirety. 

25 In other embodiments of the invention, the polypeptides of the invention are 

employed as a research tool for identification, characterization and purification of 
interacting, regulatory proteins. Appropriate labels are incorporated into the 
polypeptides of the invention by various methods known in the art and the 
polypeptides are used to capture interacting molecules. For example, molecules are 

30 incubated with the labeled polypeptides, washed to remove unbound polypeptides, and 
the polypeptide complex is quantified. Data obtained using different concentrations of 
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polypeptide are used to calculate values for the number, affinity, and association of 
polypeptide with the protein complex. 

Labeled polypeptides are also useful as reagents for the purification of 
molecules with which the polypeptide interacts including, but not limited to, inhibitors. 
5 In one embodiment of affinity purification, a polypeptide is covalently coupled to a 
chromatography column. Cells and their membranes are extracted, and various cellular 
subcomponents are passed over the column. Molecules bind to the column by virtue 
of their affinity to the polypeptide. The polypeptide-complex is recovered from the 
column, dissociated and the recovered molecule is subjected to protein sequencing. 
10 This amino acid sequence is then used to identify the captured molecule or to design 
degenerate oligonucleotides for cloning the corresponding gene from an appropriate 
cDNA library. 

Alternatively, compounds may be identified which exhibit similar properties to 
the ligand for the NgR of the invention, but which are smaller and exhibit a longer half 

15 time than the endogenous ligand in a human or animal body. When an organic 
compound is designed, a molecule according to the invention is used as a "lead" 
compound. The design of mimetics to known pharmaceutically active compounds is a 
well-known approach in the development of pharmaceuticals based on such "lead" 
compounds. Mimetic design, synthesis and testing are generally used to avoid 

20 randomly screening a large number of molecules for a target property. Furthermore, 
structural data deriving from the analysis of the deduced amino acid sequences 
encoded by the DNAs of the present invention are useful to design new drugs, more 
specific and therefore with a higher pharmacological potency. 

Comparison of the protein sequence of the present invention with the 

25 sequences present in all the available databases showed a significant homology with the 
transmembrane portion of G protein coupled receptors. Accordingly, computer 
modeling can be used to develop a putative tertiary structure of the proteins of the 
invention based on the available information of the transmembrane domain of other 
proteins. Thus, novel ligands based on the predicted structure of NgR can be 

30 designed. 

This invention further pertains to novel agents identified by the above-described 
screening assays and uses thereof for treatments as described herein. 



WO 02/29059 



PCT/US01/31488 



-88- 

Compositions and Pharmaceutical Compositions 

In a particular embodiment, the novel molecules identified by the screening 
methods according to the invention are low molecular weight organic molecules, in 
which case a composition or pharmaceutical composition can be prepared thereof for 
5 oral or parenteral administration. The compositions, or pharmaceutical compositions, 
comprising the nucleic acid molecules, vectors, polypeptides, antibodies and 
compounds identified by the screening methods described herein, typically comprise 
the nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable 
carrier. As used herein, "pharmaceutically acceptable caITier ,, is intended to include 

10 any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, compatible with pharmaceutical 
administration. The nature of the carrier or other ingredients will depend on the 
specific route of administration and particular embodiment of the invention to be 
administered. Examples of techniques and protocols that are useful in this context are, 

1 5 inter alia , found in Remington's PHARMACEUTICAL SCIENCES, 1 6th ed. , (1 980) Osol, 
A (Ed.), which is incorporated herein by reference in its entirety. Preferred examples 
of such carriers or diluents include, but are not limited to, water, saline, Ringer's 
solution, dextrose solution and 5% human serum albumin. Liposomes and 
non-aqueous vehicles such as fixed oils may also be used. The use of such media and 

20 agents for pharmaceutically active substances is well known in the art. Except insofar 
as any conventional media or agent is incompatible with the active compound, use 
thereof in the compositions is contemplated. Supplementary active compounds can 
also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 

25 with its intended route of administration. Examples of routes of administration include 
oral and parenteral (e.g., intravenous, intradermal, subcutaneous, inhalation, 
transdermal (topical), transmucosal and rectal administration). Solutions or 
suspensions used for parenteral, intradermal or subcutaneous application can include 
the following components: a sterile diluent such as water for injection, saline solution, 

30 fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
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acid; buffers such as acetates, citrates or phosphates, and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or 
bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can 
be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
5 plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 

10 water, Cremophor EL™ (BASF, Parsippany, NX) or phosphate buffered saline (PBS). 
In all cases, the composition must be sterile and should be fluid to the extent that easy 
syringeability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 

15 example, water, ethanol, polyol (for example, glycerol, propylene glycol and liquid 
polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 

20 antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 

ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can 
be brought about by including in the composition an agent which delays absorption, for 

25 example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a NgR protein or anti-NgR antibody) in the required amount in an 
appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by 

30 incorporating the active compound into a sterile vehicle that contains a basic dispersion 
medium and the required other ingredients from those enumerated above. In the case 
of sterile powders for the preparation of sterile injectable solutions, methods of 
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preparation are vacuum drying and freeze-drying that yields a powder of the active 
ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
5 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients 
and used in the form of tablets, troches or capsules. Oral compositions can also be 
prepared using a fluid carrier for use as a mouthwash, wherein the compound in the 
fluid carrier is applied orally and swished and expectorated or swallowed. 

10 Pharmaceutical^ compatible binding agents, and/or adjuvant materials can be included 
as part of the composition. The tablets, pills, capsules, troches and the like can contain 
any of the following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel or corn starch; a lubricant 

15 such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
' sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable 

20 propellant, e.g., a gas such as carbon dioxide or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. 
For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration, detergents, bile 

25 salts, and fiisidic acid derivatives. Transmucosal administration can be accomplished 
through the use of nasal sprays or suppositories. For transdermal administration, the 
active compounds are formulated into ointments, salves, gels or creams as generally 
known in the art. 

The compounds can also be prepared in the form of suppositories {e.g., with 
30 conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 
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In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
5 polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the 
art. The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutical^ 

10 acceptable carriers. These can be prepared according to methods known to those 
skilled in the art, for example, as described in U.S. Patent No. 4,522,81 1. It is 
especially advantageous to formulate oral or parenteral compositions in dosage unit 
form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be 

15 treated; each unit containing a predetermined quantity of active compound calculated 
to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active 
compound and the particular therapeutic effect to be achieved. 

20 The nucleic acid molecules of the invention can be inserted into vectors and 

used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by 
any of a number of routes, e.g., as described in U.S. Patent No. 5,703,055. Delivery 
can thus also include, e.g., intravenous injection, local administration (see U.S. Patent 
No. 5,328,470) or stereotactic injection (see e.g., Chen et al (1994) Proc. Natl. Acad 

25 Sci. USA 91, 3054-3057). The pharmaceutical preparation of the gene therapy vector 
can include the gene therapy vector in an acceptable diluent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery vector can be produced intact from recombinant cells, e.g., 
retroviral vectors, the pharmaceutical preparation can include one or more cells that 

30 produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack or 
dispenser together with instructions for administration. 
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The dosage of these low molecular weight compounds will depend on the 
disease state or condition to be treated and other clinical factors such as weight and 
condition of the human or animal and the route of administration of the compound. 
For treating human or animals, between approximately 0.5 mg/kg of body weight to 
5 500 mg/kg of body weight of the compound can be administered. Therapy is typically 
administered at lower dosages and is continued until the desired therapeutic outcome is 
observed. 

Another aspect of the present invention is the use of the NgR nucleotide 
sequences disclosed herein for identifying homologs of the Nogo-R, in other animals, 

10 including but not limited to humans and other mammals and invertebrates. Any of the 
nucleotide sequences disclosed herein, or any portion thereof, can be used, for 
example, as probes to screen databases or nucleic acid libraries, such as, for example, 
genomic or cDNA libraries, to identify homologs using screening procedures well 
known to those skilled in the art. Accordingly, homologs having at least 50%, more 

15 preferably at least 60%, more preferably at least 70%, more preferably at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 100% homology with NgR sequences can be identified. 

The present compounds and methods, including nucleic acid molecules, 
polypeptides, antibodies, compounds identified by the screening methods described 

20 herein, have a variety of pharmaceutical applications and may be used, for example, to 
treat or prevent unregulated cellular growth, such as cancer cell and tumor growth. In 
a particular embodiment, the present molecules are used in gene therapy. For a review 
of gene therapy procedures, see e.g. Anderson Science (1992) 256, 808-813, which is 
incorporated herein by reference in its entirety. 

25 The present invention also encompasses a method of agonizing (stimulating) or 

antagonizing a NgR natural binding partner associated activity in a mammal comprising 
administering to said mammal an agonist or antagonist to one of the above disclosed 
polypeptides in an amount sufficient to effect said agonism or antagonism. One 
embodiment of the present invention, then, is a method of treating diseases in a 

30 mammal with an agonist or antagonist of the protein of the present invention 
comprising administering the agonist or antagonist to a mammal in an amount 
sufficient to agonize or antagonize NgR-associated functions. 
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Methods of determining the dosages of compounds to be administered to a 
patient and modes of administering compounds to an organism are disclosed in U.S. 
Application Serial No. 08/702,282, filed August 23, 1996, and International patent 
publication number WO 96/22976, published August 1, 1996, both of which are 
5 incorporated herein by reference in their entirety, including any drawings, figures or 
tables. Those skilled in the art will appreciate that such descriptions are applicable to 
the present invention and can be easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used and the size and physiological condition 

10 of the patient. Therapeutically effective doses for the compounds described herein can 
be estimated initially from cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
takes into account the IC 50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

15 Plasma half-life and biodistribution of the drug and metabolites in the plasma, 

tumors and major organs can also be determined to facilitate the selection of drugs 
most appropriate to inhibit a disorder. Such measurements can be carried out. For 
example, HPLC analysis can be performed on the plasma of animals treated with the 
drug and the location of radiolabeled compounds can be determined using detection 

20 methods such as X-ray, CAT scan and MRI. Compounds that show potent inhibitory 
activity in the screening assays, but have poor pharmacokinetic characteristics, can be 
optimized by altering the chemical structure and retesting. In this regard, compounds 
displaying good pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell 

25 composition. For example, toxicity studies can be carried out in a suitable animal 
model as follows: (1) the compound is administered to mice (an untreated control 
mouse should also be used); (2) blood samples are periodically obtained via the tail 
vein from one mouse in each treatment group; and (3) the samples are analyzed for red 
and white blood cell counts, blood cell composition and the percent of lymphocytes 

30 versus polymorphonuclear cells. A comparison of results for each dosing regime with 
the controls indicates if toxicity is present. 
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At the termination of each toxicity study, further studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary 
Medical Association guidelines Report of the American Veterinary Medical Assoc. 
Panel on Euthanasia, (1993) J. Am. Vet Med. Assoc. 202:229-249). Representative 
animals from each treatment group can then be examined by gross necropsy for 
immediate evidence of metastasis, unusual illness or toxicity. Gross abnormalities in 
tissue are noted and tissues are examined histologically. Compounds causing a 
reduction in body weight or blood components are less preferred, as are compounds 
having an adverse effect on major organs. In general, the greater the adverse effect the 
less preferred the compound. 

For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and 
most preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided 
plasma levels of the active moiety are sufficient to maintain therapeutic effectiveness. 
Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compound the lower the plasma levels necessary to achieve efficacy. 

NgR mRNA transcripts have been found in the brain and heart. SEQ ED NOs: 
1 and/or, 3 will, as detailed above, enable screening the endogenous 
neurotransmitters/hormones/Iigands which activate, agonize, or antagonize NgR and 
for compounds with potential utility in treating disorders including CNS disorders 
(e.g., stroke) and degenerative disorders such as those associated with demyelination. 

For example, NgR receptor activation may mediate the prevention of neurite 
outgrowth. Inhibition would be beneficial in both chronic and acute brain injury. See, 
e.g., Donovan et al, (1997) J. Neuwsci. 17, 5316-5326; Turgeon et al., (1998) J. 
Neurosci. 18, 6882-6891; Smith-Swintosky etal, (1997)7. Neurochem. 69, 
1890-1896; Grille* a/., (1998) Brain Res. 191, 321-327; Suidanetfa/., (1996) Semin. 
Thromb. Hemost. 22, 125-133. 
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Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on NgR 
activity (e.g., NgR gene expression), as identified by a screening assay described herein 
can be administered to individuals to treat (prophylactically or therapeutically) 
5 disorders (e.g., a disease condition such as a demyelination disorder) associated with 
aberrant NgR activity. In conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an individual's genotype and that individual's 
response to a foreign compound or drug) of the individual may be considered. 
Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic 

10 failure by altering the relation between dose and blood concentration of the 

pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments 
based on a consideration of the individual's genotype. Such pharmacogenomics can 
further be used to determine appropriate dosages and therapeutic regimens. 

15 Accordingly, the activity of NgR protein, expression of NgR nucleic acid or mutation 
content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 

20 persons. See e.g., Eichelbaum (1996) Clin. Exp. Pharmacol. Physiol. 23, 983-985 and 
Under (1997) Clin. Chem. 43, 254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body (altered drug action) or genetic conditions 
transmitted as single factors altering the way the body acts on drugs (altered drug 

25 metabolism). These pharmacogenetic conditions can occur either as rare defects or as 
polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency 
is a common inherited enzymopathy in which the main clinical complication is 
haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

30 As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 
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(NAT 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an 
explanation as to why some patients do not obtain the expected drug effects or show 
exaggerated drug response and serious toxicity after taking the standard and safe dose 
of a drug. These polymorphisms are expressed in two phenotypes in the population, 
5 the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is 
different among different populations. For example, the gene coding for CYP2D6 is 
highly polymorphic and several mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 
quite frequently experience exaggerated drug response and side effects when they 

10 receive standard doses. If a metabolite is the active therapeutic moiety, PM show no 
therapeutic response, as demonstrated for the analgesic effect of codeine mediated by 
its CYP2D6-formed metabolite morphine. At the other extreme are the so called ultra- 
rapid metabolizers who do not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 

15 amplification. 

Thus, the activity of NgR protein, expression of NgR nucleic acid, or mutation 
content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles 

20 encoding drug-metabolizing enzymes to the identification of an individual's drug 

responsiveness phenotype. This knowledge, when applied to dosing or drug selection, 
can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a NgR modulator, such as a 
modulator identified by one of the exemplary screening assays described herein. 

25 

Monitoring Clinical Efficacy 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression 
or activity of NgR (e.g., the ability to modulate aberrant cell proliferation and/or 
differentiation) can be applied not only in basic drug screening, but also in clinical 
30 trials. For example, the effectiveness of an agent determined by a screening assay as 
described herein to increase NgR gene expression, protein levels or upregulate NgR 
activity, can be monitored in clinical trials of subjects exhibiting decreased NgR gene 
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expression, protein levels, or downregulated NgR activity. Alternatively, the 
effectiveness of an agent determined by a screening assay to decrease NgR gene 
expression, protein levels, or downreguiate NgR activity, can be monitored in clinical 
trials of subjects exhibiting increased NgR gene expression, protein levels, or 
5 upregulated NgR activity. In such clinical trials, the expression or activity of NgR and, 
preferably, other genes that have been implicated in, for example, a disease or disorder, 
can be used as a "read out" or markers of the immune responsiveness of a particular 
cell. 

For example, genes, including NgR, that are modulated in cells by treatment 
10 with an agent (e.g., compound, drug or small molecule) that modulates NgR activity 
(e.g., identified in a screening assay as described herein) can be identified. Thus, to 
study the effect of agents on demyelination disorders, for example, in a clinical trial, 
cells can be isolated and KNA prepared and analyzed for the levels of expression of 
NgR and other genes implicated in the disorder. The levels of gene expression (Le. 9 a 
1 5 gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 
described herein, or alternatively by measuring the amount of protein produced by one 
of the methods as described herein or by measuring the levels of activity of NgR or 
other genes. In this way, the gene expression pattern can serve as a marker, indicative 
of the physiological response of the cells to the agent. Accordingly, this response state 
20 may be determined before, and at various points during, treatment of the individual 
with the agent. 

In one embodiment, the invention provides a method for monitoring the 
effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
protein, peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate 

25 identified by the screening assays described herein) comprising the steps of (/) 

obtaining a pre-administration sample from a subject prior to administration of the 
agent; (if) detecting the level of expression of a NgR protein, mRNA, or genomic 
DNA in the preadministration sample; (///) obtaining one or more post-administration 
samples from the subject; (jv) detecting the level of expression or activity of the NgR 

30 protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing 
the level of expression or activity of the NgR protein, mRNA or genomic DNA in the 
pre-administration sample with the NgR protein, mRNA or genomic DNA in the post 
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administration sample or samples; and (vi) altering the administration of the agent to 
the subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of NgR to higher levels than detected, 
i.e., to increase the effectiveness of the agent. Alternatively, decreased administration 
5 of the agent may be desirable to decrease expression or activity of NgR to lower levels 
than detected, i.e., to decrease the effectiveness of the agent. 



Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods 

10 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 
associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that 

15 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to, (/) a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (//) antibodies to a NgR 
peptide; (Hi) nucleic acids encoding a NgR peptide; (;v) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous 

20 insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 
recombination (see, e.g., Capecchi (1989) Science 244, 1288-1292); or (v) modulators 
(i.e., inhibitors, agonists and antagonists, including additional peptide mimetic of the 
invention or antibodies specific to a peptide of the invention) that alter the interaction 

25 between a NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 

30 Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 
bioavailability. 
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Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 
5 include, but are not limited to, immunoassays (e.g., by Western blot analysis, 

immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.). 
In one aspect, the invention provides a method for preventing, in a subject, a 

10 disease or condition associated with an aberrant NgR expression or activity, by 

administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 

15 agent can occur prior to the manifestation of symptoms characteristic of the NgR 
aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

20 Another aspect of the invention pertains to methods of modulating NgR 

expression or activity for therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 

25 protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR 

peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 
or more NgR protein activity. Examples of such stimulatory agents include active 
NgR protein and a nucleic acid molecule encoding NgR that has been introduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protein activity. 

30 Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro (e.g., by 
culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the 
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agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by a screening assay described 
5 herein), or combination of agents that modulates (e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR expression or activity. 

10 Gene Therapy 

Mutations in the NgR gene that result in loss of normal function of the NgR 
gene product underlie NgR human disease states. The invention comprehends gene 
therapy to restore NgR activity to treat those disease states. Delivery of a functional 
NgR gene to appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, 

15 and more particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a 
retrovirus), or ex vivo by use of physical DNA transfer methods (e.g., liposomes or 
chemical treatments). See, for example, Anderson (1998) Nature, supplement to 
392(6679):25-20. For additional reviews of gene therapy technology see Friedmann 
(1989) Science 244, 1275-1281; Verma (1990) Sci. Am. 68-84; and Miller (1992) 

20 Nature 357, 455-460. Alternatively, it is contemplated that in other human disease 
states, preventing the expression of, or inhibiting the activity of, NgR will be useful in 
.'treating disease states. It is contemplated that antisense therapy or gene therapy could 
be applied to negatively regulate the expression of NgR. 

The present invention provides for both prophylactic and therapeutic methods 

25 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 
associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that 

30 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to, (i) a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (») antibodies to a NgR 
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peptide; (in) nucleic acids encoding a NgR peptide; (iv) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous 
insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 
5 recombination (see, e.g., Capecchi (1989) , above); or (v) modulators (i.e., inhibitors, 
agonists and antagonists, including additional peptide mimetic of the invention or 
antibodies specific to a peptide of the invention) that alter the interaction between a 
NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 

10 not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 
Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 

15 bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 

20 include, but are not limited to, immunoassays (e.g., by Western blot analysis, 

immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.). 
In one aspect, the invention provides a method for preventing, in a subject, a 

25 disease or condition associated with an aberrant NgR expression or activity, by 

administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 

30 agent can occur prior to the manifestation of symptoms characteristic of the NgR 

aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
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NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

Another aspect of the invention pertains to methods of modulating NgR 
expression or activity for therapeutic purposes. The modulatory method of the 
5 invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR 
peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 

10 or more NgR protein activity. Examples of such stimulatory agents include active 

NgR protein and a nucleic acid molecule encoding NgR that has been introduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protein activity. 
Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro, (e.g., by 

1 5 culturing the cell with the agent) or, alternatively, in vivo (e.g. , by administering the 
agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by a screening assay described 

20 herein), or combination of agents that modulates (e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR expression or activity. 

The present invention is not to be limited in scope by the specific embodiments 

25 described herein. Indeed, various modifications of the invention in addition to those 
described herein will become apparent to those skilled in the art from the foregoing 
description and accompanying figure. Such modifications are intended to fall within 
the scope of the appended claims. 

The following Table 5 contains the sequences of exemplary polynucleotides 

30 and polypeptides of the invention. 
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TABLE5 



The following DNA sequence NgR2 <SEQ ID NO. 1> was identified in humans: 



ATGCTGCCCGGGCTCAGGCGCCTGCTGCAAGCTCCCGCCTCGGCCTGCCTCCTGCTGATG 
CTCCTGGCCCTGCCCCTGGCGGCCCCCAGCTGCCCCATGCTCTGCACCTGCTACTCATCC 
CCGCCCACCGTGAGCTGCCAGGCCAACAACTTCTCCTCTGTGCCGCTGTCCCTGCCACCC 
AGCACTCAGCGACTCTTCCTGCAGAACAACCTCATCCGCACGCTGCGGCCAGGCACCTTT 
GGGTCCAACCTGCTCACCCTGTGGCTCTTCTCCAACAACCTCTCCACCATCTACCCGGGC 
ACTTTCCGCCACTTGCAAGCCCTGGAGGAGCTGGACCTCGGTGACAACCGGCACCTGCGC 
TCGCTGGAGCCCGACACCTTCCAGGGCCTGGAGCGGCTGCAGTCGCTGCATTTGTACCGC 
TGCCAGCTCAGCAGCCTGCCCGGCAACATCTTCCGAGGCCTGGTCAGCCTGCAGTACCTC 
TACCTCCAGGAGAACAGCCTGCTCCACCTACAGGATGACTTGTTCGCGGACCTGGCCAAC 
CTGAGCCACCTCTTCCTCCACGGGAACCGCCTGCGGCTGCTCACAGAGCACGTGTTTCGC 
GGCCTGGGCAGCCTGGACC GGCTGCTGCTGCAC GGGAACCGGCTGCAGGGCGTGCACCGC 
GCGGCCTTCCGCGGCCTCAGCCGCCTCACCATCCTCTACCTGTTCAACAACAGCCTGGCC 
TCGCTGCCCGGCGAGGCGCTCGCCGACCTGCCCTCGCTCGAGTTCCTGCGGCTCAACGCT 
AACCCCTGGGCGTGCGACTGCCGCGCGCGGCCGCTCTGGGCCTGGTTCCAGCGCGCGCGC 
GTGTCCAGCTCCGACGTGACCTGCGCCACCCCCCCGGAGCGCCAGGGCCGAGACCTGCGC 
GCGCTCCGCGAGGCCGACTTCCAGGCGTGTCCGCCCGCGGCACCCACGCGGCCGGGCAGC 
CGCGCCCGCGGCAACAGCTCCTCCAACCACCTGTACGGGGTGGCCGAGGCCGGGGCGCCC 
CCAGCCGATCCCTCCACCCTCTACCGAGATCTGCCTGCCGAAGACTCGCGGGGGCGCCAG 
GGCGGGGACGCGCCTACTGAGGACGACTACTGGGGGGGCTACGGGGGTGAGGACCAGCGA 
GGGGAGCAGATGTGCCCCGGCGCTGCCTGCCAGGCGCCCCCGGACTCCCGAGGCCCTGCG 
CTCTCGGCCGGGCTCCCCAGCCCTCTGCTTTGCCTCCTGCTCCTGGTGCCCCACCACCTC 



The following amino acid sequence <SEQ ID NO. 2> is the predicted amino acid 

sequence derived from the DNA sequence of SEQ ID NO. 1 : 

MLPGLRRLLQAPASACLLLMLLALPLAAPSC 
PMLCTCYSSPPTVSCQANNFSSVPLSLPPST 
QRLFLQNNLIRTLRPGTFGSNLLTLWLFSNN 
LSTIYPGTFRHLQALEELDLGDNRHLRSLEP 
DTFQGLERLQSLHLYRCQLSSLPGNIFRGLV 
SLQYLYLQENSLLHLQDDLFADLANLSHLFL 
HGNRLRLLTEHVFRGLGSLDRLLLHGNRLQG 
VHRAAFRGLSRLTILYLFNNSLASLPGEALA 
DLPSLEFLRLNANPWACDCRARPLWAWFQRA 
RVSSSDVTCATPPERQGRDLRALREADFQAC 
PPAAPTRPGSRARGNSSSNHLYGVAEAGAPP 
ADPSTLYRDLPAED SRGRQ GGDAPTEDDYWG 
GYGGEDQRGEQMCPGAACQAPPDSRGPALSA 
GLPSPLLCLLLL VPHHL 



WO 02/29059 



PCT/US01/31488 



10 



15 



20 



25 



-104- 

The following DNA sequence NgR3 <SEQ ID NO. 3> was identified in mouse: 

ATGTCTTGGCAGTCTGGAACCACAGTGACACAATCTCCCGTGCAGGCTGCTCAGGTCTCA 
GGGTGCTGTGTGGAATTGCTGCTGTTGCTGCTCGCTGGAGAGCTACCTCTGGGTGGTGGT 
TGTCCTCGAGACTGTGTGTGCTACCCTGCGCCCATGACTGTCAGCTGCCAGGCACACAAC 
TTTGCTGCCATCCCGGAGGGCATCCCAGAGGACAGTGAGCGCATCTTCCTGCAGAACAAT 
CGCATCACCTTCCTCCAGCAGGGCCACTTCAGCCCCGCCATGGTCACCCTCTGGATCTAC 
TCCAACAACATCACTTTCATTGCTCCCAACACCTTCGAGGGCTTTGTGCATCTGGAGGAG 
CTAGACCTTGGAGACAACCGACAGCTGCGAACGCTGGCACCCGAGACCTTCCAAGGCCTG 
GTGAAGCTTCACGCCCTCTACCTCTATAAGTGTGGACTGAGCGCCCTGCCCGCAGGCATC 
TTTGGTGGCCTGCACAGCCTGCAGTATCTCTACTTGCAGGACAACCATATCGAGTACCTC 
CAAGATGACATCTTTGTGGACCTGGTCAATCTCAGTCACTTGTTTCTCCATGGTAACAAG 
CTATGGAGCCTGGGCCAAGGCATCTTCCGGGGCCTGGTGAACCTGGACCGGTTGCTGCTG 
CATGAGAACCAGCTACAGTGGGTTCACCACAAGGCTTTCCATGACCTCCACAGGCTAACC 
ACCCTCTTTCTCTTCAACAACAGCCTCACTGAGCTGCAGGGTGACTGTCTGGCCCCCCTG 
GTGGCCTTGGAGTTCCTTCGCCTCAATGGGAATGCTTGGGACTGTGGCTGCCGGGCACGT 
TCCCTGTGGGAATGGCTGCGAAGGTTCCGTGGCTCTAGCTCTGCTGTCCCCTGCGCGACC 
CCCGAGCTGCGGCAAGGCCAGGATCTGAAGCTGCTGAGGGTGGAGGACTTCCGGAACTGC 
ACAGGACCAGTGTCTCCTCACCAGATCAAGTCTCACACGCTTACCACCTCTGACAGGGCT 
GCCCGCAAGGAGCACCATCCGTCCCATGGGGCCTCCAGGGACAAAGGCCACCCACATGGC 
CATCCGCCTGGCTCCAGGTCAGGTTACAAGAAGGCAGGCAAGAACTGCACCAGCCACAGG 
AACCGGAACCAGATCTCTAAGGTGAGCTCTGGGAAAGAGCTTACCGAACTGCAGGACTAT 
GCCCCCGACTATCAGCACAAGTTCAGCTTTGACATCATGCCCACCGCACGACCCAAGAGG 
AAGGGCAAGTGTGCTCGCAGGACCCCCATCCGTGCCCCCAGTGGGGTGCAGCAGGCATCC 
TCAGGCACGGCCCTTGGGGCCCCACTCCTGGCCTGGATACTGGGGCTGGCAGTCACTCTC 
CGC 
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The following protein sequence <SEQ ED NO. 4> is deduced protein of SEQ ID 
NO:3: 

MSWQSGTTVTQSPVQAAQVSGCCVELLLLLL 
AGELPLGGGCPRDCVCYPAPMTVSCQAHNFA 
AIPEGIPEDSERIFLQNNRITFLQQGHFSPA 
M VTL WI YS NNITFI APNTFE GF VHLEELD L G 
DNRQLRTLAPETFQGLVKLHALYLYKCGLSA 
LPAGIFGGLHSLQYLYLQDNHIEYLQDDIFV 
DLVNLSHLFLHGNKLWSLGQGIFRGLVNLDR 
LLLHENQL Q W VHHK AFHDLHRLTTLFLFNN S 
LTELQGDCL APL V ALEFLRLN GN A WD C G CR A 
RSLWEWLRRFRGSSS A VP C A TP E L R Q G Q D L K 
LLRVEDFRNCTGP VSPHQIKSHTLTT SD R A A 
RKEHHPSHGASRDKGHPHGHPPGSRSGYKKA 
GKNCTSHRNRNQISKVSSGKELTELQD YAPD 
YQHKFSFDIMPT ARPKRKGKC ARRTPIRAPS 
GVQ QASSGTALGAPLLAWILGLAVTLR 
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The following protein sequence <SEQ ID NO. 5> is NgRl from humans: 

MKRASAGGSRLLAWVLWLQAWQVAAPCPGA 
C 

CYNEPKVTTSCPQQGLQAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTIL WLHSNVL 
ARIDAAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAA 
LQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHV 
HPHAFRDLGRLMTLYLFANNLSALPTEALAP 
LRALQYLRLNDNPWVCDCRARPLWAWLQKFR 
GSSSEVPCSLPQRLAGRDLKRLAANDLQGCA 
VATGPYHPIWTGRATDEEPLGLPKCCQPDAA 
DKASVLEPGRPASAGNALKGRVPPGDSPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAVRPEGS 
EPPGFPTSGPRRRPGCSRKNRTRSHCRLGQ A 
GSGGGGTGDSEGSG ALP SLT CSLTPLGLALV 
LWTVLGPC 

The following amino acid sequence <SEQ ID NO:6> is a Consensus Sequence of 
NgR based on homology with NgRl 

CPXXCXCYXXPXXTXSCXXXXXXXXPX 
XXPXXXXRXFLXXNXIXXXXXXXFXXXXXXXXLWX 
XSNXXXXIXXXXFXXXXXLEXLDLXDNXXLR 
XXXPXTFXGLXXLXLXLXXCXLXXLXXXXFX 
GLXXLQYLYLQXNXXXXLXDDXFXDLXNLXH 
LFLHGNXXXXXXXXXFRGLXXLDRLLLHXNX 
XXXVHXXAFXXLXRLXXLXLFXNXLXXLXXX 
XLAXLXXLXXLRLNXNXWXCXCRARXLWXWX 
XXXRXSSSXVXCXXPXXXXGXDLXXLXXXDX 
XXCXXXXXPXXPXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXGXXXXXXXXXXXX 
PPXXXSXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXRXXX 
XXXXXXXXXXXXXXXXXXXXXXXXLXXXXX 
XX X XX L 



The following protein sequence <SEQ ID NO:7> is the 66 amino acid active domain 
ofNogo: 

RIYKGVIQAIQKSDEGHPFRAYLESEVAISE 
ELVQKYSNSALGHVNCTIKELRRLFLVDDLV 
DSLK 
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The following protein sequence <SEQ ID NO:8> is the amino acid sequence of the 
mature NgR2: 

CPMLCTCYSSPPTVSCQ.ANNFSSVPLSLPPS 
TQRLFLQNNLIRTLRPGTF GS NLLTLWLFSN 
NLSTIYPGTFRHLQALEELDLGDNRHLRSLE 
PDTFQGLERLQSLHLYRCQLSSLPGNIFRGL 
VSLQYLYLQENSLLHLQDDLFADLANLSHLF 
LHGNRLRLLTEHVFRGLGSLDRLLLHGNRLQ 
GVHRAAFRGLSRLTILYLFNNSLASLPGEAL 
ADLPSLEFLRLNANPWACDCRARPLWAWFQR 
ARVSSSDVTCATPPERQGRDLRALREADFQA 
CPPAAPTRPGSRARGNSSSNHLYGVAEAGAP 
PADPSTLYRDLPAEDSRGRQGGDAPTEDDYW 
GGYGGEDQRGEQMCPGAACQAPPD SRGPALS 
AGLPSPLLCLLLL VP HHL 



The following protein sequence <SEQ ID NO:9> is the amino acid sequence of the 
mature NgR3: 

CPRDCVCYPAPMTVSCQAHNFAAIPEGIPED 
SERIFLQNNRITFLQQGHFSPAMVTLWIYSN 
NITFIAPNTFEGFVHLEELDLGDNRQLRTLA 
PETFQGLVKLHALYLYKCGLSALPAGIFGGL 
HSLQYLYLQDNHIEYLQDDIFVDLVNLSHLF 
LHGNKLWSLGQGIFRGLVNLDRLLLHENQLQ 
WVHHKAFHDLHRLTTLFLFNNSLTELQGDCL 
APLVALEFLRLNGNAWDCGCRARSLWEWLRR 
FRGSSSAVPCATPELRQGQDLKLLRVEDFRN 
CTGPVSPHQIKSHTLTTSDRAARKEHHPSHG 
ASRDKGHPHGHPPGSRSGYKKAGKNCTSHRN 
RNQISKVSSGKELTELQDYAPDYQHKFSFDI 
MPTARPKRKGKCARRTPIRAPSGVQQASSGT 
ALGAPLLAWILGLAVTLR 



The following amino acid sequence <SEQ ID NO:10> is a conserved cysteine motif 
(Cysteine domain 1) of the NgR and homologs based on the Consensus Sequence: 
CPXXCXCYXXPXXTXSC 



WO 02/29059 



PCT/US01/31488 



-107- 



The following amino acid sequence <SEQ ID NO: 11> is a conserved cysteine motif 

(Cysteine domain 2) of the NgR and homologs based on the Consensus Sequence: 

NXWXCXCRARXLWXWXXXXRXSSSXVXCXXP 
XXX XGXDLXXLXXXDXXXC 

5 



The following amino acid sequence <SEQ ID NO: 12> is a conserved Leucine-rich 

domain of the NgR and homologs based on the Consensus Sequence: 

RXFLXXNXIXXXXXXXFXXXXXXXXLWXXSN 
10 XXXXIXXXXFXXXXXLEXLDLXDNXXLRXXX 
PXTFXGLXXLXLXLXXCXLXXLXXXXFXGLX 
XLQYLYLQXNXXXXLXDDXFXDLXNLXHLFL 
HGNXXXXXXXXXFRGLXXLDRLLLHXNXXXX 
VHXXAFXXLXRLXXLXLFXNXLXXLXXXXLA 
15 XLXXLXXLRL 



Unless otherwise indicated, X is any amino acid. For example, X where 
indicated may be no amino acid. Additional features of the invention will be apparent 
from the following Examples. Examples 1-5 are actual, while the remaining Examples 

20 are prophetic. 

As shown by the following Examples, a gene encoding novel NgRs have been 
identified by computational analysis of DNA sequence data. The proteins encoded by 
NgR2 and NgR3 have a putative signal sequence, eight leucine-rich repeat domains in 
a conserved leucine-rich region (SEQ ID NO: 12), a conserved cysteine-rich region 

25 (SEQ ID NO: 1 0) N-terminal to the leucine-rich region, a second cysteine-rich domain 
(SEQ ID NO: 1 1) C-terminal to the leucine-rich region, and a putative 
glycophosphatidylinositol-linkage (GPI-linkage) site. NgR2 and NgR3 differ from the 
previously identified NgR sequence. The NgR homologs, when compared to known 
NgRs, show a consensus sequence (SEQ ID NOs:6). The putative mature NgR2 and 

30 NgR3 are shown in Table 5 as SEQ ID NOs: 8 and 9, respectively. 



Example 1: Tblastn query of the HTG database 

The protein sequence for the human NgR (NgRl) (SEQ ID NO:5) was used to 
query the high throughput genomic (HTG) database the use of which is familiar to 
35 those skilled in the art. The HTG database is a part of GenBank, a comprehensive 
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N1H genetic sequence database, which includes an annotated collection of all publicly 
available DNA sequences (Nucleic Acids Res. (2000) 28, 15-8). The HTG database 
includes sequences obtained from genomic DNA. Within genomic DNA, genes are 
typically encoded by multiple segments of DNA called exons. Thus when one aligns a 
5 cDNA sequence (or a protein sequence encoded by a cDNA sequence) to a genomic 
sequence, the sequence will be broken up into segments depending on the number of 
exons in the gene. 

The BLAST algorithm, which stands for Basic Local Alignment Search Tool is 
suitable for determining sequence similarity (Altschul et al, (1990) X Mol Biol 215, 

10 403-410, which is incorporated herein by reference in its entirety). Software for 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information (http://www.ncbi.nlm.nih.gov/). The basic BLAST 
algorithm involves first identifying high scoring sequence pair (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some 

15 positive-valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score threshold 
(Altschul et al, supra). These initial neighborhood word hits act as seeds for initiating 
searches to find HSPs containing them. The word hits are extended in both directions 
along each sequence, for as far as the cumulative alignment score can be increased. 

20 Extension for the word hits in each direction are halted when: 1) the cumulative 

alignment score falls off by the quantity X from its maximum achieved value; 2) the 
cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or 3) the end of either sequence is reached. The 
Blast algorithm parameters W, T and X determine the sensitivity and speed of the 

25 alignment. The Blast program uses as defaults a word length (W) of 1 1, the 

BLOSUM62 scoring matrix (see Henikoff a/., (1992) Proc. Nail Acad Sci. USA 
89, 10915-10919, which is incorporated herein by reference in its entirety) alignments 
(B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm (Karlin etal, (1993) Proc Natl Acad Sci. USA 90, 

30 5873-5787, which is incorporated herein by reference) and Gapped BLAST perform a 
statistical analysis of the similarity between two sequences. One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability (P(N))> which 
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provides an indication of the probability by which a match between two nucleotide or 
amino acid sequences would occur by chance. For example, a nucleic acid is 
considered similar to a NgR gene or cDNA if the smallest sum probability in 
comparison of the test nucleic acid to a NgR nucleic acid is less than about 1, 
5 preferably less than about 0. 1 3 more preferably less than about 0.01 , and most 
preferably less than about 0.001. 

To query the HTG database with the NgR protein sequence, we used a 
variation of the BLAST algorithm known as the tblastn program, which compares a 
protein query sequence against a nucleotide sequence database dynamically translated 

10 in all reading frames (J. Mol Biol (1990) 215, 403-410: Nucleic Acids Res. (1997) 
25, 3389-3402). The results of the tblastn search indicated the presence of genes in 
the database with a significant identity to the NgR. In addition to finding hits to 
genomic clones which contain the human and mouse NgR genes, we found hits to 
clones where the identity was not as high, but still very significant. Three human 

15 clones were found (Accession numbers: AC068514, AC016869, AC013606) with an 
e-value of 4e-43 and one mouse clone was found (Accession No. AC021768) with an 
e-value of le-78. The three human clones all appeared to encode the same gene, so 
further analysis was confined to AGO 13 606. 

20 Example 2: Prediction of the human NgR2 protein sequence (AC013606) 

The human NgR protein sequence aligned with two regions of translated 
sequence from nucleotide sequence AC013606, indicating that the new gene was 
encoded by at least two exons. In order to define the complete gene, we used the 
computer program GENSCAN™ {1 Mol Biol (1997) 268, 78-94) which can identify 

25 complete exon/intron structures of genes in genomic DNA. The gene prediction by 
GENESCAN™ contained seven exons. By comparing these predicted exons to the 
NgR, it was concluded that the new human gene contains two of these exons and a 
part of another (containing the initiating methionine). The predicted cDNA (mRNA) 
encoded by these three exons was assembled from AC0 13606 (HTG1 1; deposited 

30 March 2000; length = 143899; GenBank release 1 18.0; SEQ ID NO: 15) by combining 
nucleotides from the three exons whose coordinates are: 123292-123322 (exon 1); 
130035-130516 (exon 2); and 138589-139335 (exon 3). The sequence for this cDNA 
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sequence is SEQ ID NO:l (nucleotide sequence of human NgR2; AC013606). The 
translation of this cDNA provides the protein sequence of human NgR2 (SEQ ID 
NO:2). 

We used the protein sequence of human NgR2 as a query sequence against the 
5 human EST database. A number of hits of high significance were found indicating that 
the NgR2 mRNA is expressed in a number of tissues including fetal brain. 
Furthermore, two of these ESTs provided support for the exon structure that we 
deduced. One EST (Accession No: GBJEST19:AI346757) contains 565 nucleotides 
corresponding to amino acids 84-271 of the human NgR2 (SEQ ID No:4). This spans 

10 the second intron located between amino acids 171 and 172, and provides positive 
evidence for the splicing of exons 2 and 3 at the mRNA level. Another EST 
(GBJEST26:AI929019) contains 545 nucleotides, part of which corresponds to amino 
acids 1-75 of the human NgR2 (SEQ ID NO:2). This spans the first intron located 
between amino acids 10 and 1 1, and provides positive evidence for the splicing of 

15 exons 1 and 2 at the mRNA level. 

Example 3: Prediction of the mouse NgR3 protein sequence (AC021768) 

The human NgR protein sequence aligned with only one region of translated 
sequence from nucleotide sequence AC021768, indicating that most of the new mouse 

20 gene was encoded by one large exon. However, upon inspection, the protein encoded 
by this exon was missing an initiating methionine. In order to define the complete 
gene, we used the computer program GENSCAN as described above. The gene 
prediction by GENSCAN contained two exons; the large one found by visual 
inspection and a short one at the 5' end which provided an initiating methionine. The 

25 predicted cDNA (mRNA) encoded by these two exons was assembled from AC021768 
(HTG14; deposited March 2000; length = 215980; GenBank release 118.0; SEQ ID 
NO: 16) by combining nucleotides from the two exons whose coordinates are: the 
complement of 164265-164325 (exon 1); and the complement of 155671-156992 
(exon 2). The sequence for this cDNA sequence is SEQ ID NO:3 (nucleotide 

30 sequence of mouse NgR3; AC021768). The translation of this cDNA provides the 
protein sequence of mouse NgR3 (SEQ ID NO:4). 
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We used the protein sequence of mouse NgR3 as a query sequence against the 
mouse EST database. One hit of high significance was found indicating that the NgR2 
mRNA is expressed in the heart. This EST (GB_EST20:AI428334) contains 463 
nucleotides, part of which correspond to amino acids 45-193 of mouse NgR3 (SEQ ID 
5 NO:4). 

Example 4: Similarity between the NgRs 

An alignment between NgRl and the two new receptors is shown in Fig. 
1A-1B. The similarities between these proteins include: 
10 (1) The SignalP program, which locates the signal sequence cleavage 

position, predicts a cleavage before the first conserved cysteine in all the proteins. 
Thus the mature protein in all cases will have a cysteine at the N-terminus. 

(2) All proteins contain eight Leucine Rich Repeats (LRR). LRRs are short 
sequence motifs present in a number of proteins with diverse functions and cellular 

15 locations. These repeats are usually involved in protein-protein interactions. Each 
LRR is composed of a beta-alpha unit. 

(3) All three proteins contain a leucine rich repeat N-terminal domain 
(LRRNT), in which four cysteines are conserved. LRRs are often flanked by cysteine 
rich domains at both their N and C termini. 

20 (4) All three proteins contain a LRR C-terminal domain (LRRCT). The 

LRRCTs of the three NgR proteins can be distinguished from those of other LRR 
containing proteins, by the pattern of typtophans and cysteines which are completely 
conserved in this domain. 

(5) All three proteins contain a conserved cysteine in the fourth LRR 
25 domain. 

(6) All three proteins contain a conserved potential glycosylation site in the 
eighth LRR domain. 

(7) NgR2 and NgR3 have a hydrophobic C-terminus, as does NgRl, an 
indication that they probably also undergo a modification similar to NgRl, where a 

30 GPI moiety is covalently linked to a C-terminal amino acid. This allows the protein to 
remain tethered to the cell. 
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Example 5: Preparation of Nogo Proteins 

A Nogo binding assay was developed which utilizes a method widely used in 
examining semaphorin and ephrin axonal guidance function (Flanagan & 
Vanderhaeghen (1998) Annu. Rev. Neurosci. 21,3 09-345; Takahashi etaL, (1999) 
5 Cell 99, 59-69). It involves fusing a secreted placental alkaline phosphatase (AP) 

moiety to the ligand in question to provide a biologically active receptor binding agent 
which can be detected with an extremely sensitive colorimetric assay. For Nogo, an 
expression vector is created encoding a signal peptide, aEDs6 tag for purification, AP, 
and the 66 amino acid active domain of Nogo. The fusion protein can be purified from 

10 the conditioned medium of transfected cells in milligram amounts. This protein is 
biologically active as a growth cone collapsing agent with an EC 50 of 1 nM. 

Alternatively, a glutathione-S-transferase Nogo (GST-Nogo) fusion protein 
may be prepared. For GST-Nogo, an expression vector (e.g., a pGEX vector) is 
created encoding a signal peptide, GST, and the 66 amino acid active domain of Nogo. 

15 GST-Nogo may be purified from the culture medium and used as a GST fusion 

protein, or GST may be cleaved from the Nogo portion of the fusion protein with an 
enzyme that recognizes the specific amino acid cleavage sit engineered between the 
GST portion and the Nogo portion of the fusion protein. Such sites are part of the 
commercially available GST vectors. The specific cleavage sites and enzymes may be 

20 used in accordance with the Manufacturers specifications. 

It has been found that AP-Nogo is actually slightly more potent than 
GST-Nogo, perhaps because the protein is synthesized in a eukaryotic rather than a 
prokaryotic cell. 

Binding of Nogo to immobilized NgR homologs may be performed in an 
25 ELIS A-type assay in which AP-Nogo is allowed to react with an immobilized receptor 
homolog. Specificity of binding may be demonstrated in a competitive binding assay 
using increasing amounts of GST-Nogo in the type of assay to show a decreasing 
amount of binding of AP-Nogo (as judged in the colorimetric assay). 



30 



Example 6: Transfected COS Cell binding Assays 

The homologs of the present invention may be used in transfection studies in 
COS cells to demonstrate binding of Nogo. Specifically, nucleotide sequences 
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encoding NgR2 and NgR3 may be transfected into COS cells using a suitable vector. 
Non-transfected COS-7 cells do not bind AP-Nogo. However, transfection of COS 
cells with nucleic acid sequences encoding NgRs will make them capable of binding 
Nogo. AP alone does not bind with any stable affinity to these transfected cells, 
5 indicating that any affinity of Nogo for NgR2 or NgR3 would be due to the 66 amino 
acids derived from Nogo. Furthemore, specific affinity of Nogo for the NgR2 or 
NgR3 proteins may be tested in displacement of AP-Nogo assays using GST-Nogo. 
NgR2 and/or NgR3 may also bind homologs of Nogo, which may also be tested using 
this assay. 

10 

Example 7: Expression of NgR in Human Cell Lines using Northern Blot and a 
Random-Primed Probe 

A Northern blot is purchased from a commercial source, or RNA samples from 
cells of interest are run on an agarose gel and blotted to a membrane using any of the 

1 5 well known techniques for Northern blotting. The blot is probed with a fragment of 
NgR2 (SEQ ID NO: 1) or NgR3 (SEQ ID NO:3). The probe is prepared from 50 ng 
of cDNA labeled by a random-primed method (Feinberg and Vogelstein (1983) Anal 
Biochem. 132, 6-13). Hybridization is carried out at 68 B C for 1 hour in 
ExpressHybTM solution (Clontech, Cat. No. 8015-1) followed by washing with 2X 

20 SSC/0.05% SDS at room temperature and two washes with 0. IX SSC/0. 1% SDS at 
50°C. Expression of NgR2 and/or NgR3 can be assessed by the presence of an 
appropriately sized band on the blot. 



25 



Example 8: Cloning of cDNA corresponding to NgRs 



To obtain the full-length clone corresponding to NgR2 from a cDNA library, 
the following method may be used. A cDNA library is generated using standard 
methods from a tissue known to contain NgR2. Such a tissue was identified in 
Example 2. 1 x 10 6 plaque forming units from the cDNA library may be screened in 
30 duplicate on OPTITRAN™ filters. The filters are hybridized with 32 P-labeled 

oligonucleotides that are generated from the ESTs corresponding to portions of NgR2. 
The hybridization reaction may consist of 400 mis plaque screen buffer (50mM Tris pH 
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7.5, IMNaCl, 0.1% Sodium pyrophosphate, 0.2% Polyvinylpryolidine and 0.2% 
Ficoll) containing 10% Dextran sulfate and 100(xg/ml tRNA and 80 pmol each 
32 P-labeled oligonucleotide at 65°C overnight. The filters are washed twice with 2X 
SSC/1%SDS and twice with IX SSC/1%SDS and exposed to film. Duplicate positives 
5 are purified. DNA from each of these clones is analyzed by restriction enzyme digest 
followed by agarose gel electrophoresis and Southern blotting. The filters are 
hybridized to the 32 P-labeled oligonucleotides used for the original hybridization to 
confirm that inserts hybridize to the probe: The insert is then sequenced to confirm that 
it represents the cDNA for NgR2. Similar methods may be used to generate a 
1 0 full-length clone corresponding to NgR3 . 

Alternatively, a full-length clone of NgR2 or NgR3 can be obtained by a 
person of ordinary skill in the art employing conventional PCR techniques. 

Example 9: Hybridization Analysis to demonstrate NgR expression in the brain 

1 5 The expression of NgR in mammals, such as the rat, may be investigated by in 

situ hybridization histochemistry. To investigate expression in the brain, for example, 
coronal and sagittal rat brain cryosections (20 [im thick) are prepared using a Reichert- 
Jung cryostat. Individual sections are thaw-mounted onto silanized, nuclease-free 
slides (CEL Associates, Inc., Houston, TX), and stored at -80°C. Sections are 

20 processed starting with post-fixation in cold 4% paraformaldehyde, rinsed in cold 

phosphate-buffered saline (PBS), acetylated using acetic anhydride in triethanolamine 
buffer, and dehydrated through a series of alcohol washes in 70%, 95%, and 100% 
alcohol at room temperature. Subsequently, sections are delipidated in chloroform, 
followed by rehydration through successive exposure to 100% and 95% alcohol at 

25 room temperature. Microscope slides containing processed cryosections are allowed 
to air dry prior to hybridization. Other tissues may be assayed in a similar fashion. 

A NgR-specific probe may be generated using PCR. Following PCR 
amplification, the fragment is digested with restriction enzymes and cloned into 
pBluescript II cleaved with the same enzymes. For production of a probe specific for 

30 the sense strand of NgR, a cloned NgR fragment cloned in pBluescript II may be 
linearized with a suitable restriction enzyme, which provides a substrate for labeled 
run-off transcripts (i.e., cRNA riboprobes) using the vector-borne T7 promoter and 



i 
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commercially available T7 RNA polymerase. A probe specific for the antisense strand 
of NgR may also be readily prepared using the NgR clone in pBluescript II by cleaving 
the recombinant plasmid with a suitable restriction enzyme to generate a linearized 
substrate for the production of labeled run-off cRNA transcripts using the T3 promoter 
5 and cognate polymerase. The riboprobes may be.labeled with [ 35 S]-UTP to yield a 
specific activity of about 0.40 x 10 6 cpm/pmol for antisense riboprobes and about 0.65 
x 10 6 cpm/pmol for sense-strand riboprobes. Each riboprobe may be subsequently 
denatured and added (2 pmol/ml) to hybridization buffer which contains 50% 
formamide, 10% dextran, 0.3 MNaCl, 10 mM Tris (pH 8.0), 1 mMEDTA, IX 

10 Denhardt's Solution, and 10 mM dithiothreitol. Microscope slides containing 

sequential brain cryosections may be independently exposed to 45 \il of hybridization 
solution per slide and silanized cover slips may be placed over the sections being 
exposed to hybridization solution. Sections are incubated overnight (15-18 hours) at 
52°C to allow hybridization to occur. Equivalent series of cryosections are then 

1 5 exposed to sense or antisense NgR-specific cRNA riboprobes. 

Following the hybridization period, coverslips are washed off the slides in IX 
SSC, followed by RNase A treatment involving the exposure of slides to 20 ^g/ml 
RNase A in a buffer containing 10 mM Tris-HCl (pH 7.4), 0.5 M EDTA, and 0.5 M 
NaCl for 45 minutes at 37°C. The cryosections are then subjected to three 

20 high-stringency washes in 0.1 X SSC at 52'C for 20 minutes each. Following the 

series of washes, cryosections are dehydrated by consecutive exposure to 70%, 95%, 
and 100% ammonium acetate in alcohol, followed by air drying and exposure to 
Kodak BioMax™ MR-1 film. After 13 days of exposure, the film is developed, and 
any significant hybridization signal is detected. Based on these results, slides containing 

25 tissue that hybridized, as shown by film autoradiograms, are coated with Kodak NTB- 
2 nuclear track emulsion and the slides are stored in the dark for 32 days, The slides 
are then developed and counterstained with hematoxylin. Emulsion-coated sections 
are analyzed microscopically to determine the specificity of labeling. The signal is 
determined to be specific if autoradiographic grains (generated by antisense probe 

30 hybridization) are clearly associated with cresyl violate-stained cell bodies. 

Autoradiographic grains found between cell bodies indicate non-specific binding of the 
probe. 
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In some cases, such as using a probe to detect a NgR homolog in a 
heterologous species, in order to achieve optimal hybridization, it may be necessary to 



activity have utility for treating neurological disorders. Some other diseases for which 
modulators of NgR may have utility include depression, anxiety, bipolar disease, 
epilepsy, neuritis, neurasthenia, neuropathy, neuroses, and the like. Use of NgR 
modulators, including NgR ligands and anti-NgR antibodies, to treat individuals having 
10 such disease states is intended as an aspect of the invention. 

Example 10: Northern Blot Analysis of NgR-RNA with a PCR-generated Probe 

Northern blot hybridizations may be performed to examine the expression of 
NgR mRNA. A clone containing at least a portion of the sequence of SEQ ID NO: 1 
1 5 may be used as a probe. Vector-specific primers are used in PCR to generate a 
hybridization probe fragment for 32 P-labeling. The PCR is performed as follows: 



5 



decrease the stringency conditions. Such conditions are well known to those of 
ordinary skill in the art and examples are provided above. 

Expression of NgR in the brain provides an indication that modulators of NgR 



20 



25 



Mix: l\xl 



2\xl 
2]xl 
lOjxl 



ljxl 
0.5^1 



NgR-containing plasmid 
fwd primer (10-50 pM) 
rev primer (10-50 pM) 

lOxPCR buffer (such as that provided with the enzyme, 
Amersham Pharmacia Biotech) 

lOmM dNTP (such as #1 969 064 from Boehringer Mannheim) 
Taq polymerase (such as #27-0799-62, Amersham Pharmacia 
Biotech) 



83.5^1 water 
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PCR is performed in a Thermocycler using the following program: 



94 9 C 



5min | 



94'C 
55°C 
72'C 



lmin 
lmin 
lmin 



30 cycles 



72'C 



lOmin 



10 



15 



20 



The PCR product may be purified using QIAquick PCR Purification Kit 
' (#28104) from Qiagen, and radictively labeled with 32 P-dCTP (#AA0005/250, 
Amersham Pharmacia Biotech)) may be done by random priming using "Ready-to-go 
DNA Labeling Beads" (#27-9240-01) from Amersham Pharmacia Biotech. 
Hybridization is carried out on Human Multiple Tissue Northern Blot from Clontech as 
described in manufacturer's protocol, or on a Northern Blot prepared by running RNA 
samples from cells of interest on an agarose gel and blotting to a membrane using any 
of the known Northern blotting protocols. After exposure overnight on Molecular 
Dynamics Phosphor Imager screen (#MD 146-8 14) bands of an appropriate size are 
visualized. 

Example 11: Recombinant Expression of NgR in Eukaryotic Host Cells 
A. Expression of NgR in Mammalian Cells 

To produce NgR protein, a NgR-encoding polynucleotide is expressed in a 
suitable host cell using a suitable expression vector and standard genetic engineering 
techniques. For example, a NgR-encoding sequence described in Table 4 is subctoned 
into the commercial expression vector pzeoSV2 (Invitrogen, San Diego, CA) and 
transfected into Chinese Hamster Ovary (CHO) cells using the transfection reagent 
FuGENE6™ (Boehringer-Mannheim) and the transfection protocol provided in the 
product insert. Other eukaryotic cell lines, including human embryonic kidney (HEK 
293) and COS cells, are suitable as well. Cells stably expressing NgR are selected by 
growth in the presence of 100 p.g/ml zeocin (Stratagene, LaJolla, CA). As an 
alternative to FuGENE6™, the expression vector may carry the gene for dihydrofolate 
reductase (dhfr) and selection of clones with methotrexate (MTX) drug pressure 



WO 02/29059 



PCT/US01/31488 



-118- 

allows for stable transforation of CHO cells. Optionally, NgR may be purified from 
the cells using standard chromatographic techniques. To facilitate purification, antisera 
is raised against one or more synthetic peptide sequences that correspond to portions 
of the NgR amino acid sequence, and the antisera is used to affinity purify Nogo-R. 
5 The NgR also may be expressed in-frame with a tag sequence (e.g. , polyhistidine, 
hemaglutinin, FLAG) to facilitate purification. Moreover, it will be appreciated that 
many of the uses for NgR polypeptides, such as assays described below, do not require 
purification of NgR from the host cell. 

10 B. Expression of NgR in CHO cells 

For expression of NgR in Chinese hamster ovary (CHO) cells, a plasmid 
bearing the relevant NgR coding sequence is prepared, using a vector which also bears 
the selectable marker dihydrofolate reductase (DHFR). The plasmid is transfected into 
CHO cells. Selection under MTX drug pressure allows for preparation of stable 
15 transformants of a NgR (NgR2 or NgR3) in an expression plasmid carrying a 
selectable marker such as DHFR. 

C. Expression of NgR in 293 cells 

For expression of NgR in mammalian cells 293 (transformed human, primary 
20 embryonic kidney cells), a plasmid bearing the relevant NgR coding sequence is 
prepared, using vector pSecTag2A (Invitrogen). Vector pSecTag2A contains the 
murine IgK chain leader sequence for secretion, the c-myc epitope for detection of the 
recombinant protein with the anti-myc antibody, a C-terminal polyhistidine for 
purification with nickel chelate chromatography, and a Zeocin resistant gene for 
25 selection of stable transfectants. The forward primer for amplification of this NgR 
cDNA is determined by routine procedures and preferably contains a 5' extension of 
nucleotides to introduce the HindlH cloning site and nucleotides matching the NgR 
sequence. The reverse primer is also determined by routine procedures and preferably 
contains a 5 ! extension of nucleotides to introduce an Xhol restriction site for cloning 
30 and nucleotides corresponding to the reverse complement of the NgR sequence. The 
PCR conditions are 55°C as the annealing temperature. The PCR product is gel 
purified and cloned into the HindDI-Xhol sites of the vector. 
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The DNA is purified using Qiagen chromatography columns and transfected 
into 293 cells using DOTAP™ transfection media (Boehringer Mannheim, 
Indianapolis, IN). Transiently transfected cells are tested for expression after 24 hours 
of transfection, using western blots probed with anti-His and anti-NgR peptide 
5 antibodies. Permanently transfected cells are selected with Zeocin and propagated. 
Production of the recombinant protein is detected from both cells and media by 
Western blots probed with anti-His, anti-Myc or anti-NgR peptide antibodies. 

D. Transient Expression of Nogo-R in COS cells 

10 For expression of the NgR in COS7 cells, a polynucleotide molecule having a 

nucleotide sequence of SEQ ID NO: 1, for example, can be cloned into vector p3-CI. 
This vector is a pUC18-derived plasmid that contains the HCMV (human 
cytomegalovirus) promoter-intron located upstream from the bGH (bovine growth 
hormone) polyadenylation sequence and a multiple cloning site. 

15 The forward primer is determined by routine procedures and preferably 

contains a 5 f extension which introduces an Xbal restriction site for cloning, followed 
by nucleotides which correspond to a nucleotide sequence of SEQ ID NO: 1 . The 
reverse primer is also determined by routine procedures and preferably contains 5- 
extension of nucleotides which introduces a Sail cloning site followed by nucleotides 

20 which correspond to the reverse complement of a nucleotide sequence of SEQ ID 
NO:l. 

The PCR consists of an initial denaturation step of 5 min at 95 °C, 30 cycles of 
30 sec denaturation at 95°C, 30 sec annealing at 58°C and 30 sec extension at 72°C, 
followed by 5 min extension at 72°C. The PCR product is gel purified and ligated into 
25 the Xbal and Sail sites of vector p3-CI. This construct is transformed into E. coli cells 
for amplification and DNA purification. The DNA is purified with Qiagen 
chromatography columns and transfected into COS 7 cells using Lipofectamine™ 
reagent from BRL, following the manufacturer's protocols. Forty-eight and 72 hours 
. after transfection, the media and the cells are tested for recombinant protein 
30 expression. 

NgR expressed from a COS cell culture can be purified by concentrating the 
cell-growth media to about 10 mg of protein/ml, and purifying the protein by, for 
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example, chromatography. Purified NgR is concentrated to 0.5 mg/ml in an Amicon 
concentrator fitted with a YM-10 membrane and stored at -80 8 C. NgR3 may also be 
expressed using this method and the nucleotide sequence of SEQ ID NO:3 or SEQ ID 
NO:13. 

5 

E. Expression of NgR in Insect Cells 

For expression of NgR in a baculovirus system, a polynucleotide molecule 
having a nucleotide sequence of SEQ ID NO:l, 3 or 13 can be amplified by PCR. The 
forward primer is determined by routine procedures and preferably contains a 5' 

10 extension which adds the Ndel cloning site, followed by nucleotides which correspond 
to a nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID NO: 13, 
respectively). The reverse primer is also determined by routine procedures and 
preferably contains a 5' extension which introduces the Kpnl cloning site, followed by 
nucleotides which correspond to the reverse complement of a nucleotide sequence of 

15 SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID NO: 13, respectively). 

The PCR product is gel purified, digested with Ndel and Kpnl, and cloned into 
the corresponding sites of vector pACHTL-A (Pharmingen, San Diego, CA). The 
pAcHTL expression vector contains the strong polyhedrin promoter of the 
Autographa californica nuclear polyhedrosis virus (AcMNPV), and a 6XHis tag 

20 upstream from the multiple cloning site. A protein kinase site for phosphorylation and 
a thrombin site for excision of the recombinant protein precede the multiple cloning 
site is also present. Of course, many other baculovirus vectors could be used in place 
of pAcHTL-A, such as pAc373, pVL941 and pAcIMl. Other suitable vectors for the 
expression of NgR polypeptides can be used, provided that the vector construct 

25 includes appropriately located signals for transcription, translation, and trafficking, 

such as an in-frame AUG and a signal peptide, as required. Such vectors are described 
inLuckowefa/., Virology 170:31-39, among others. 

The virus is grown and isolated using standard baculovirus expression methods, 
such as those described in Summers et al (1 987) A MANUAL OF METHODS FOR 

30 Baculovirus Vectors and Insect Cell Culture Procedures, Texas Agricultural 
Experimental Station Bulletin No. 1555. 
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In a preferred embodiment, p AcHLT-A containing NgR gene is introduced into 
baculovirus using the "BaculoGold™" transfection kit (Pharmingen, San Diego, CA) 
using methods established by the manufacturer. Individual virus isolates are analyzed 
for protein production by radiolabeling infected cells with 35 S-methionine at 24 hours 
5 post infection. Infected cells are harvested at 48 hours post infection, and the labeled 
proteins are visualized by SDS-PAGE. Viruses exhibiting high expression levels can 
be isolated and used for scaled up expression. 

For expression of a NgR polypeptide in a Sf9 cells, a polynucleotide molecule 
having the nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID 

10 NO: 13) can be amplified by PCR using the primers and methods described above for 
baculovirus expression. The NgR cDNA is cloned into vector pAcHLT-A 
(Pharmingen) for expression in Sf9 insect. The insert is cloned into the Ndel and Kpnl 
sites, after elimination of an internal Ndel site (using the same primers described above 
for expression in baculovirus). DNA is purified with Qiagen chromatography columns 

1 5 and expressed in Sf9 cells. Preliminary Western blot experiments from non-purified 
plaques are tested for the presence of the recombinant protein of the expected size 
which reacted with the NgR-specific antibody. These results are confirmed after 
further purification and expression optimization in HiG5 cells. 

20 F. Expression of soluble forms of NgR2 and NgR3 as NgR-Ig fusion 

proteins. 

To generate aNgR2-Ig fusion protein, standard methods may be used as 
described in the literature (e.g. Sanicola et al (1997) Proc. Natl Acad. Set USA. 94, 
6238-6243). For example, a DNA fragment encoding NgR2 without the sequence 

25 encoding the hydrophobic C-terminus (GPI anchor signal) may be ligated to a DNA 
fragment encoding the Fc domain of IgGl (which may be human IgGl), and the 
chimeric fragment may be cloned into an expression vector to generate a plasmid. The 
plasmid may then be transfected into Chinese hamster ovary cells to generate a stable 
cell line producing the fusion protein. The fusion protein is then purified from 

30 conditioned media using standard methods. For example, clarified conditioned media 
from the cell line may be loaded by gravity directly onto Protein A Sepharose. The 
column may then be washed with five column volumes each of PBS, PBS containing 
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0.5 MNaCl, and 25 mM sodium phosphate, 100 mMNaCl (pH 5.0). The bound 
protein may then be eluted with 25 mM NaH 2 P0 4 , 100 mM NaCl (pH 2.8) and 
immediately neutralized with 1/10 fraction volume of 0.5 M ^HPC^ (pH 8.6). 
Similar methods may be used to generate a NgR3-Ig fusion protein. 

5 

Example 12: Interaction Trap/Two-Hybrid System 

In order to assay for NgR-interacting proteins, the interaction trap/two-hybrid 
library screening method can be used. This assay was first described in Fields et al 
(1989) 'Nature 340, 245, which is incorporated herein by reference in its entirety. A 

1 0 protocol is published in CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 1 999, John 
Wiley & Sons, NY and Ausubel, F. M. et al 1992, SHORT PROTOCOLS IN MOLECULAR 
Biology, fourth edition, Greene and Wiley-interscience, NY, which is incorporated 
herein by reference in its entirety. Kits are available from Clontech, Palo Alto, CA 
(Matchmaker Two-Hybrid System 3). 

1 5 A fusion of the nucleotide sequences encoding all or partial NgR and the yeast 

transcription factor GAL4 DNA-binding domain (DNA-BD) is constructed in an 
appropriate plasmid (i.e., pGBKT7) using standard subcloning techniques. Similarly, a 
GAL4 active domain (AD) fusion library is constructed in a second plasmid (i.e. , 
pGADT7) from cDNA of potential NgR-binding proteins (for protocols on forming 

20 cDNA libraries, see Sambrook et al 1989, MOLECULAR CLONING: A LABORATORY 
MANUAL, second edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY), which is incorporated herein by reference in its entirety. The DNA-BD/NgR 
fusion construct is verified by sequencing, and tested for autonomous reporter gene 
activation and cell toxicity, both of which would prevent a successful two-hybrid 

25 analysis. Similar controls are performed with the AD/library fusion construct to ensure 
expression in host cells and lack of transcriptional activity. Yeast cells are transformed 
(ca. 105 transformants/mg DNA) with both the NgR and library fusion plasmids 
according to standard procedure (Ausubel, et al, 1992, Short PROTOCOLS IN 
Molecular Biology, fourth edition, Greene and Wiley-interscience, NY, which is 

30 incorporated herein by reference in its entirety). In vivo binding of DNA-BD/NgR 
with AD/library proteins results in transcription of specific yeast plasmid reporter 
genes (i.e., lacZ t HIS3, ADE2, LEU2). Yeast cells are plated on nutrient-deficient 
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media to screen for expression of reporter genes. Colonies are dually assayed for 
p-galactosidase activity upon growth in Xgal 

(5-bromo-4-chloro-3-indoIyl-b-D-galactoside) supplemented media (filter assay for 
b-galactosidase activity is described in Breeden et al, (1985) Cold Spring Harb. Symp. 
5 Quant Biol, 50, 643, which is incorporated herein by reference in its entirety). 

Positive AD-library plasmids are rescued from transformants and reintroduced into the 
original yeast strain as well as other strains containing unrelated DNA-BD fusion 
proteins to confirm specific NgR/library protein interactions. Insert DNA is sequenced 
to verify the presence of an open reading frame fused to GAL4 AD and to determine 
1 0 the identity of the NgR-binding protein. 

Example 13: Antibodies to Nogo-R 

Standard techniques are employed to generate polyclonal or monoclonal 
antibodies to the NgR receptor, and to generate useful antigen-binding fragments 

15 thereof or variants thereof, including "humanized" variants. Such protocols can be 
found, for example, in Sambrook et al (1989), above, and Harlow et al (Eds.), 
Antibodies A Laboratory manual; Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY (1988). In one embodiment, recombinant NgR polypeptides (or 
cells or cell membranes containing such polypeptides) are used as antigen to generate 

20 the antibodies. In another embodiment, one or more peptides having amino acid 

sequences corresponding to an immunogenic portion of NgR {e.g., 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, or more amino acids) are used as antigen. Peptides 
corresponding to extracellular portions of Nogo-R, especially hydrophilic extracellular 
portions, are preferred. The antigen may be mixed with an adjuvant or linked to a 

25 hapten to increase antibody production. 

A. Polyclonal or Monoclonal antibodies 

As one exemplary protocol, recombinant NgR or a synthetic fragment thereof 
is used to immunize a mouse for generation of monoclonal antibodies (or larger 
30 mammal, such as a rabbit, for polyclonal antibodies). To increase antigenicity, 

peptides are conjugated to Keyhole Limpet Hemocyanin (Pierce), according to the 
manufacturer's recommendations. For an initial injection, the antigen is emulsified with 
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Freund's Complete Adjuvant and injected subcutaneously. At intervals of two to three 
weeks, additional aliquots of NgR antigen are emulsified with Freund's Incomplete 
Adjuvant and injected subcutaneously. Prior to the final booster injection, a serum 
sample is taken from the immunized mice and assayed by western blot to confirm the 
5 presence of antibodies that immunoreact with NgR. Serum from the immunized 

animals may be used as polyclonal antisera or used to isolate polyclonal antibodies that 
recognize NgR. Alternatively, the mice are sacrificed and their spleen removed for 
generation of monoclonal antibodies. 

To generate monoclonal antibodies, the spleens are placed in 10 ml serum-free 

10 RPMI 1640, and single cell suspensions are formed by grinding the spleens in serum- 
free RPMI 1640, supplemented with 2 mM L-glutamine, 1 mM sodium pyruvate, 100 
units/ml penicillin, and 100 [ig/ml streptomycin (RPMI) (Gibco, Canada). The cell 
suspensions are filtered and washed by centrifugation and resuspended in serum-free 
RPMI. Thymocytes taken from three naive Balb/c mice are prepared in a similar 

15 manner and used as a Feeder Layer. NS-1 myeloma cells, kept in log phase in RPMI 
with 10% fetal bovine serum (FBS) (Hyclone Laboratories, Inc., Logan, UT) for three 
days prior to fusion, are centrifiiged and washed as well. 

To produce hybridoma fusions, spleen cells from the immunized mice are 
combined with NS-1 cells and centrifiiged, and the supernatant is aspirated. The cell 

20 pellet is dislodged by tapping the tube, and 2 ml of 37°C PEG 1500 (50% in 75 mM 
HEPES, pH 8.0) (Boehringer-Mannheim) is stirred into the pellet, followed by the 
addition of serum-free RPMI. Thereafter, the cells a^e centrifiiged, resuspended in 
RPMI containing 15% FBS, 100 yM sodium hypoxanthine, 0.4 \M aminopterin, 16 
\iM thymidine (HAT) (Gibco), 25 units/ml IL-6 (Boehringer-Mannheim) and 1.5 x 10 6 

25 thymocytes/ml, and plated into 10 Corning flat-bottom 96-well tissue culture plates 
(Corning, Corning, NY). 

On days 2, 4, and 6 after the fusion, 100 \il of medium is removed from the 
wells of the fusion plates and replaced with fresh medium. On day 8, the fusions are 
screened by ELISA, testing for the presence of mouse IgG that binds to NgR. 

30 Selected fusion wells are further cloned by dilution until monoclonal cultures 
producing anti-NgR antibodies are obtained. 
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B. Humanization of anti-NgR monoclonal antibodies 

The expression pattern of NgR as reported herein and the potential of NgRs as 
targets for therapeutic intervention suggest therapeutic indications for NgR inhibitors 
(antagonists). NgR-neutralizing antibodies comprise one class of therapeutics useful as 
5 NgR antagonists. Following are protocols to improve the utility of anti-NgR 

monoclonal antibodies as therapeutics in humans by "humanizing" the monoclonal 
antibodies to improve their serum half-life and render them less immunogenic in human 
hosts (i.e., to prevent human antibody response to non-human anti-NgR antibodies). 

The principles of humanization have been described in the literature and are 
10 facilitated by the modular arrangement of antibody proteins. To minimize the 
possibility of binding complement, a humanized antibody of the IgG4 isotype is 
preferred. 

For example, a level of humanization is achieved by generating chimeric 
antibodies comprising the variable domains of non-human antibody proteins of interest 

1 5 with the constant domains of human antibody molecules. (See, e.g. , Morrison et al , 
(1989) Adv. Immunol., 44, 65-92). The variable domains of NgR-neutralizing anti- 
NgR antibodies are cloned from the genomic DNA of a B-cell hybridoma or from 
cDNA generated from mRNA isolated from the hybridoma of interest. The V region 
gene fragments are linked to exons encoding human antibody constant domains, and 

20 the resultant construct is expressed in suitable mammalian host cells (e.g., myeloma or 
CHO cells). 

To achieve an even greater level of humanization, only those portions of the 
variable region gene fragments that encode antigen-binding complementarity 
determining regions ("CDR") of the non-human monoclonal antibody genes are cloned 

25 into human antibody sequences. (See, e.g., Jones et al, (1986) Nature 321, 522-525; 
Riechmann et al, (1988) Nature 332, 323-327; Verhoeyen et al, (1988) Science 239, 
1534-1536 ; and Tempest et al, (1991) Bio/Technology 9, 266-271). If necessary, the 
13-sheet framework of the human antibody surrounding the CDR3 regions also is 
modified to more closely mirror the three dimensional structure of the antigen-binding 

30 domain of the original monoclonal antibody. (See Kettleborough et al, (1991) Protein 
Engin. 4, 773-783; and Foote et al, (1992) J. Mol. Biol 224, 487-499). 
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In an alternative approach, the surface of a non-human monoclonal antibody of 
interest is humanized by altering selected surface residues of the non-human antibody, 
e.g., by site-directed mutagenesis, while retaining all of the interior and contacting 
residues of the non-human antibody. See Padlan (1991) Mol Immunol 28, 489-498 . 
5 The foregoing approaches are employed using NgR-neutralizing anti-NgR 

monoclonal antibodies and the hybridomas that produce them to generate humanized 
NgR-neutralizing antibodies useful as therapeutics to treat or palliate conditions 
wherein NgR expression or ligand-mediated NgR signaling is detrimental. 

10 C. Human NgR-Neutralizing Antibodies from Phage Display 

Human NgR-neutralizing antibodies are generated by phage display techniques 
such as those described in Aujame etal. (1997) Human Antibodies 8, 155-168; 
Hoogenbpom (1997) TIBTECH 15, 62-70; and Rader et al (1997), Curr. Opin. 
Biotechnol. 8, 503-508, all of which are incorporated by reference. For example, 

15 antibody variable regions in the form of Fab fragments or linked single chain Fv 

fragments are fused to the amino terminus of filamentous phage minor coat protein 
pin. Expression of the fusion protein and incorporation thereof into the mature phage 
coat results in phage particles that present an antibody on their surface and contain the 
genetic material encoding the antibody. A phage library comprising such constructs is 

20 expressed in bacteria, and the library is screened for NgR-specific phage-antibodies 
using labeled or immobilized NgR as antigen-probe. 

D. Human NgR-neutralizing antibodies from transgenic mice 

Human NgR-neutralizing antibodies are generated in transgenic mice essentially 
25 as described in Bruggemann et al (1996) Immunol Today 17, 391-397 and 

Bruggemanne/a/. (1997) Curr. Opin. Biotechnol 8, 455-458. Transgenic mice 
carrying human V-gene segments in germline configuration and that express these 
transgenes in their lymphoid tissue are immunized with a NgR composition using 
conventional immunization protocols, hybridomas are generated using B cells from the 
30 immunized mice using conventional protocols and screened to identify hybridomas 
secreting anti-NgR human antibodies (e.g., as described above). 
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Example 14: Assays to Identify Modulators of NgR Activity 

Set forth below are several nonlimiting assays for identifying modulators 
(agonists and antagonists) of NgR activity. Among the modulators that can be 
identified by these assays are natural ligand compounds of the receptor; synthetic 
5 analogs and derivatives of natural ligands; antibodies, antibody fragments, and/or 
antibody-like compounds derived from natural antibodies or from antibody-like 
combinatorial libraries; and/or synthetic compounds identified by high-throughput 
screening of libraries; and the like. All modulators that bind NgR are useful for 
identifying NgR in tissue samples (e.g., for diagnostic purposes, pathological purposes, 
10 and the like). Agonist and antagonist modulators are useful for up-regulating and 
down-regulating NgR activity, respectively, to treat disease states characterized by 
abnormal levels of NgR activity. The assays may be performed using single putative 
modulators, and/or may be performed using a known agonist in combination with 
candidate antagonists (or visa versa). 

15 

A. cAMP Assays 

In one type of assay, levels of cyclic adenosine monophosphate (cAMP) are 
measured in NgR-transfected cells that have been exposed to candidate modulator 
compounds. Protocols for cAMP assays have been described in the literature. (See, 

20 e.g., Sutherland et aL, (1968) Circulation 37, 279; Frandsen et aL, (1976) Life 

Sciences 18, 529-541; Dooley et aL, (1997) J. Pharmacol Exp. Therap. 283, 735-41; 
and George et aL, (1997) J. BiomoL Screening 2, 235-40). An exemplary protocol for 
such an assay, using an Adenylyl Cyclase Activation FlashPlate® Assay from NEN™ 
Life Science Products, is set forth below. 

25 Briefly, the NgR coding sequence (e.g., a cDNA or intronless genomic DNA) 

is subcloned into a commercial expression vector, such as pzeoSV2 (Invitrogen), and 
transiently transfected into Chinese Hamster Ovary (CHO) cells using known methods, 
such as the transfection protocol provided by Boehringer-Mannheim when supplying 
the FuGENE 6 transfection reagent. Transfected CHO cells are seeded into 96-well 

30 microplates from the FlashPlate® assay kit, which are coated with solid scintillant to 
which antisera to cAMP has been bound. For a control, some wells are seeded with 
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wild type (untransfected) CHO cells. Other wells in the plate receive various amounts 
of a cAMP standard solution for use in creating a standard curve. 

One or more test compounds (i.e., candidate modulators) are added to the cells 
in each well, with water and/or compound-free medium/diluent serving as a control or 
5 controls. After treatment, cAMP is allowed to accumulate in the cells for exactly 15 
minutes at room temperature. The assay is terminated by the addition of lysis buffer 
containing [ 125 I]-labeled cAMP, and the plate is counted using a Packard Topcount™ 
96-well microplate scintillation counter. Unlabeled cAMP from the lysed cells (or 
from standards) and fixed amounts of [ l25 I]-cAMP compete for antibody bound to the 

10 plate. A standard curve is constructed, and cAMP values for the unknowns are 

obtained by interpolation. Changes in intracellular cAMP levels of cells in response to 
exposure to a test compound are indicative of NgR modulating activity. Modulators 
that act as agonists of receptors which couple to the G s subtype of G proteins will 
stimulate production of cAMP, leading to a measurable 3-10 fold increase in cAMP 

1 5 levels. Agonists of receptors which couple to the subtype of G proteins will inhibit 
forskolin-stimulated cAMP production, leading to a measurable decrease in cAMP 
levels of 50-100%. Modulators that act as inverse agonists will reverse these effects at 
receptors that are either constitutive!/ active or activated by known agonists. 

20 B. Aequorin Assays 

In another assay, cells (e.g., CHO cells) are transiently co-transfected with both 
a NgR expression construct and a construct that encodes the photoprotein apoaquorin. 
In the presence of the cofactor coelenterazine, apoaquorin will emit a measurable 
luminescence that is proportional to the amount of intracellular (cytoplasmic) free 

25 calcium, (See generally, Cobbold, et al "Aequorin measurements of cytoplasmic free 
calcium," In: McCormack J.G. and Cobbold P.H., eds., CELLULAR CALCIUM: A 
Practical Approach. Oxford:IRL Press (1991); Stables et al, (1997) Anal. 
Biochem. 252, 1 1 5-26; and Haugland, HANDBOOK OF FLUORESCENT PROBES AND 
Research Chemicals. Sixth edition. Molecular Probes, Eugene, OR (1996)). 

30 In one exemplary assay, NgR is subcloned into the commercial expression 

vector pzeoSV2 (Invitrogen) and transiently co-transfected along with a construct that 
encodes the photoprotein apoaquorin (Molecular Probes, Eugene, OR) into CHO cells 
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using the transfection reagent FuGENE 6 (Boehringer-Mannheim) and the transfection 
protocol provided in the product insert. 

The cells are cultured for 24 hours at 37°C in MEM (Gibco/BRL, 
Gaithersburg, MD) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 
5 U/ml penicillin and 10 ng/ml streptomycin, at which time the medium is changed to 
serum-free MEM containing 5 (iM coelenterazine (Molecular Probes, Eugene, OR). 
Culturing is then continued for two additional hours at 37°C. Subsequently, cells are 
detached from the plate using VERSEN (Gibco/BRL), washed, and resuspended at 
200,000 cells/ml in serum-free MEM. 

1 0 Dilutions of candidate NgR modulator compounds are prepared in serum-free 

MEM and dispensed into wells of an opaque 96-well assay plate at 50 jil/well. Plates 
are then loaded onto an MLX microtiter plate luminometer (Dynex Technologies, Inc., 
Chantilly, VA). The instrument is programmed to dispense 50 \il cell suspensions into 
each well, one well at a time, and immediately read luminescence for 15 seconds. 

15 Dose-response curves for the candidate modulators are constructed using the area 
under the curve for each light signal peak. Data are analyzed with SlideWrite, using 
the equation for a one-site ligand, and EC 50 values are obtained. Changes in 
luminescence caused by the compounds are considered indicative of modulatory 
activity. Modulators that act as agonists at receptors which couple to the G q subtype 

20 of G proteins give an increase in luminescence of up to 100 fold. Modulators that act 
as inverse agonists will reverse this effect at receptors that are either constitutively 
active or activated by known agonists. 

C. Luciferase Reporter Gene Assay 

25 The photoprotein luciferase provides another useful tool for assaying for 

modulators of NgR activity. Cells (e.g., CHO cells or COS 7 cells) are transiently co- 
transfected with both a NgR expression construct {e.g., NgR in pzeoSV2) and a 
reporter construct which includes a gene for the luciferase protein downstream from a 
transcription factor binding site, such as the cAMP-response element (CRE), AP-1, or 

30 NF-kappa B. Expression levels of luciferase reflect the activation status of the 

signaling events. (See generally, George et al (1997) J. Biomol Screening 2, 235- 
240; and Stratowa a/. (1995) Curr. Opin. Biotechnol 6, 574-581). Luciferase 
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activity may be quantitatively measured using, e.g., luciferase assay reagents that are 
commercially available from Promega (Madison, WI). 

In one exemplary assay, CHO cells are plated in 24-well culture dishes at a 
density of 100,000 cells/well one day prior to transfection and cultured at 37°C in 
5 MEM (Gibco/BRL) supplemented with 10% fetal bovine serum, 2 rnM glutamine, 10 
U/ml penicillin and 10 p-g/ml streptomycin. Cells are transiently co-transfected with 
both a NgR expression construct and a reporter construct containing the luciferase 
gene. The reporter plasmids CRE-luciferase, AP-1 -luciferase and NF-kappaB- 
luciferase may be purchased from Stratagene (Legally, CA). Transfections are 

10 performed using the FuGENE 6 transfection reagent (Boehringer-Mannheim) 

according to the supplier's instructions. Cells transfected with the reporter construct 
alone are used as a control. Twenty-four hours after transfection, cells are washed 
once with PBS pre-warmed to 37 B C. Serum-free MEM is then added to the cells 
either alone (control) or with one or more candidate modulators and the cells are 

15 incubated at 37°C for five hours. Thereafter, cells are washed once with ice-cold PBS 
and lysed by the addition of 100 \i\ of lysis buffer per well from the luciferase assay kit 
supplied by Promega. After incubation for 15 minutes at room temperature, 15 jil of 
the lysate is mixed with 50 |xl of substrate solution (Promega) in an opaque-white, 
96-well plate, and the luminescence is read immediately on a Wallace model 1450 

20 MicroBeta scintillation and luminescence counter (Wallace Instruments, Gaithersburg, 
MD). 

Differences in luminescence in the presence versus the absence of a candidate 
modulator compound are indicative of modulatory activity. Receptors that are either 
constitutively active or activated by agonists typically give a 3-20-fold stimulation of 
25 luminescence compared to cells transfected with the reporter gene alone. Modulators 
that act as inverse agonists will reverse this effect. 



D. Intracellular calcium measurement using FLJPR 

Changes in intracellular calcium levels are another recognized indicator of 
30 receptor activity, and such assays can be employed to screen for modulators of NgR 
activity. For example, CHO cells stably transfected with a NgR expression vector are 
plated at a density of 4 x 10 4 cells/well in Packard black-walled, 96-well plates 
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specially designed to discriminate fluorescence signals emanating from the various 
wells on the plate. The cells are incubated for 60 minutes at 37°C in modified 
Dulbecco's PBS (D-PBS) containing 36 mg/L pyruvate and 1 g/L glucose with the 
addition of 1% fetal bovine serum and one of four calcium indicator dyes (Fluo-3™ 
5 AM, Fluo-4™ AM, Calcium Green™-1 AM, or Oregon Green™ 488 BAPTA-1 AM), 
each at a concentration of 4 pM. Plates are washed once with modified D-PBS 
without 1% fetal bovine serum and incubated for 10 minutes at 37*C to remove 
residual dye from the cellular membrane. In addition, a series of washes with modified 
D-PBS without 1% fetal bovine serum is performed immediately prior to activation of 

1 0 the caScium response. 

A calcium response is initiated by the addition of one or more candidate 
receptor agonist compounds, calcium ionophore A23187 (10 \iM; positive control), or 
ATP (4 }iM; positive control). Fluorescence is measured by Molecular Device's 
FLIPR with an argon laser (excitation at 488 nm). (See, e.g., Kuntzweiler et al 

1 5 (1 998) Drug Dev. Res. 44, 14-20). The F-stop for the detector camera is set at 2. S and 
the length of exposure is 0.4 milliseconds. Basal fluorescence of cells is measured for 
20 seconds prior to addition of candidate agonist, ATP, or A23 187, and the basal 
fluorescence level is subtracted from the response signal. The calcium signal is 
measured for approximately 200 seconds, taking readings every two seconds. Calcium 

20 ionophore A23 187 and ATP increase the calcium signal 200% above baseline levels. 
In general, activated NgRs increase the calcium signal at least about 10-15% above 
baseline signal. 

E. [ 35 S]GTPyS Binding Assay 
25 It is also possible to evaluate whether NgR signals through a G 

protein-mediated pathway. Because G protein-coupled receptors signal through 
intracellular G proteins whose activity involves GTP binding and hydrolysis to yield 

35 

bound GDP, measurement of binding of the non-hydrolyzable GTP analog [ S]- 
GTPyS in the presence and absence of candidate modulators provides another assay 
30 for modulator activity. (See, e.g., Kowal et al, (1998) Neuropharmacology 37, 179- 
187.). 
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In one exemplary assay, cells stably transfected with a NgR expression vector 
are grown in 10 cm tissue culture dishes to subconfluence, rinsed once with 5 ml of 
ice-cold Ca 2+ /Mg 2+ -free phosphate-buffered saline, and scraped into 5 ml of the same 
buffer. Cells are pelleted by centrifugation (500 x g, 5 minutes), resuspended in TEE 
5 buffer (25 mM Tris, pH 7. 5 , 5 mM EDTA, 5 mM EGTA), and frozen in liquid 

nitrogen. After thawing, the cells are homogenized using a Dounce homogenizer (1 ml 
TEE per plate of cells), and centrifiiged at 1,000 xg for 5 minutes to remove nuclei 
and unbroken cells. 

The homogenate supernatant is centrifiiged at 20,000 x g for 20 minutes to 

10 isolate the membrane fraction, and the membrane pellet is washed once with TEE and 
resuspended in binding buffer (20 mM HEPES, pH 7.5, 150 mM NaCl, 10 mM MgCl 2 , 
1 mM EDTA). The resuspended membranes can be frozen in liquid nitrogen and 
stored at -70 8 C until use. 

Aliquots of cell membranes prepared as described above and stored at -70°C 

15 are thawed, homogenized, and diluted into buffer containing 20 mM HEPES, 10 mM 
MgCl 2 , 1 mM EDTA, 120 mMNaCl, 10 pM GDP, and 0.2 mM ascorbate, at a 
concentration of 10-50 ng/ml. In a final volume of 90 homogenates are incubated 
with varying concentrations of candidate modulator compounds or 100 \lM GTP for 
30 minutes at 30°C and then placed on ice. To each sample, 10 \il guanosine 

20 5'-0-(3[ 35 S]thio) triphosphate (NEN, 1200 Ci/mmol; [ 35 S]-GTPyS), was added to a 
final concentration of 100-200 pM. Samples are incubated at 30°C for an additional 30 
minutes, 1 ml of 1 0 mM HEPES, pH 7.4, 10 mM MgCl 2 , at 4'C is added and the 
reaction is stopped by filtration. 

Samples are filtered over Whatman GF/B filters and the filters are washed with 

25 20 ml ice-cold 10 mM HEPES, pH 7.4, 10 mM MgCl 2 . Filters are counted by liquid 
scintillation spectroscopy. Nonspecific binding of [ 35 S]-GTPyS is measured in the 
presence of 100 \iM GTP and subtracted from the total. Compounds are selected that 
modulate the amount of [ 35 S]-GTPyS binding in the cells, compared to untransfected 
control cells. Activation of receptors by agonists gives up to a five-fold increase in 

30 [ 35 S]-GTPyS binding. This response is blocked by antagonists. 



F. [ 3 H] Arachidonic Acid Release 



wo 
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The activation of NgRs may also potentiate arachidonic acid release in cells, 
providing yet another useful assay for modulators of NgR activity. (See, e.g., 
Kanterman et al y (1991) MoL Pharmacol 39,364-369.) For example, CHO cells that 
are stably transfected with a NgR expression vector are plated in 24- well plates at a 
density of 15,000 cells/well and grown in MEM medium supplemented with 10% fetal 
bovine serum, 2 mM glutamine, 10 U/ml penicillin and 10 jig/ml streptomycin for 48 
hours at 37°C before use. Cells of each well are labeled by incubation with 
[^-arachidonic acid (Amersham Corp., 210 Ci/mmol) at 0.5 jxCi/ml in 1 ml MEM 
supplemented with 10 mM HEPES, pH 7.5, and 0.5% fatty-acid-free bovine serum 
albumin for 2 hours at 37'C. The cells are then washed twice with 1 ml of the same 
buffer. 

Candidate modulator compounds are added in 1 ml of the same buffer, either 
alone or with 10 jiM ATP and the cells are incubated at 37'C for 30 minutes. Buffer 
alone and mock-transfected cells are used as controls. Samples (0.5 ml) from each 
well are counted by liquid scintillation spectroscopy. Agonists which activate the 
receptor will lead to potentiation of the ATP-stimulated release of [ 3 H]-arachidonic 
acid. This potentiation is blocked by antagonists. 

G. Extracellular Acidification Rate 

20 In yet another assay, the effects of candidate modulators of NgR activity are 

assayed by monitoring extracellular changes in pH induced by the test compounds (see, 
e.g., Dunlop et al (1998) Pharmacol Toxicol Metk 40, 47-55). In one 
embodiment, CHO cells transfected with a NgR expression vector are seeded into 12 
mm capsule cups (Molecular Devices Corp.) at 4 x 10 5 cells/cup in MEM 

25 supplemented with 10% fetal bovine serum, 2 mM L-glutamine, 10 U/ml penicillin, and 
10 ixg/ml streptomycin. The cells are incubated in this medium at 37'C in 5% C0 2 for 
24 hours. 

Extracellular acidification rates are measured using a Cytosensor 
microphysiometer (Molecular Devices Corp.). The capsule cups are loaded into the 
30 sensor chambers of the microphysiometer and the chambers are perfused with running 
buffer (bicarbonate-free MEM supplemented with 4 mM L-glutamine, 10 units/ml 
penicillin, 10 ng/ml streptomycin, 26 mM NaCl) at a flow rate of 100 |xl/minute. 



10 
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Candidate agonists or other agents are diluted into the running buffer and perfused 
through a second fluid path. During each 60-second pump cycle, the pump is run for 
38 seconds and is off for the remaining 22 seconds. The pH of the running buffer in 
the sensor chamber is recorded during the cycle from 43-58 seconds, and the pump is 
5 re-started at 60 seconds to start the next cycle. The rate of acidification of the running 
buffer during the recording time is calculated by the Cytosoft program. Changes in the 
rate of acidification are calculated by subtracting the baseline value (the average of 4 
rate measurements immediately before addition of a modulator candidate) from the 
highest rate measurement obtained after addition of a modulator candidate. The 
10 selected instrument detects 61 mV/pH unit. Modulators that act as agonists of the 
receptor result in an increase in the rate of extracellular acidification compared to the 
rate in the absence of agonist. This response is blocked by modulators which act as 
antagonists of the receptor. 

15 Example 15: mNgR3 does not bind hNogo-A(1055-1120) 

To functionally test the mouse NgR3 (hereinafter, mNgR3) for its ability to 
bind hNogo-A(1055-l 120), a cDNA expression vector for a myc epitope-tagged 
mNgR3protein was created. The mouse NgR3 cDNA was amplified by PCR from 
mouse adult brain cDNA, from the signal sequence to the stop codon, and ligated into 

20 the pSecTag2 vector such that the vector encodes a signal sequence followed by a myc 
tag followed by the mature niNgR3 sequence. This plasmid was transfected into 
COS07cells, and expression of a myc-tagged protein of the predicted size was verified 
by immunoblot analysis. Alkaline phosphatase-hNogo-A(1055-l 120) binding studies 
and myc immunohistology were conducted as described (Fournier et al., supra). 

25 The cells expressing mNgR3 express the myc-tagged protein but binding to 

AP-hNogo-A(1055-1120) was not observed under the conditions employed (Fig. 8). 

Example 16: Identification of partial human NgR3 cDNA and protein sequences 

The tblastn program was used to search for the human homolog of mouse 
30 NgR3 . The mouse NgR3 protein sequence (SEQ ID NO:4) was used to query a 
proprietary human expressed sequence tag (EST) database from Incyte yielding one 
highly significant hit: Incyte Template ID 190989.1. This sequence (937 nucleotides) 
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contains an open reading frame of 3 12 amino acids in the second reverse frame that 
exhibits 88% identity with residues 66 to 381 of mouse NgR3 (SEQ ID NO:4), 
strongly indicating that it is part of the human NgR3 homolog. 

A query of SEQ ID NO:4 against the public human EST database in Genbank 
5 also produced a hit with a 465-bp EST (Accession number: R35699; Version number: 
R35699. 1; GI: 792600). There are a number of single nucleotide deletions and 
insertions within this sequence which cause frame shift errors. All of the reliable 
sequence contained in this public EST is present in the Incyte EST (Template ID 
190989.1). 

10 To obtain more nucleotide sequence that would extend the amino acid 

sequence at that carboxy terminal end, the IMA.G.E. Consortium clone No. 38319, 
which corresponds to Genbank accession No. R35699, was purchased from Incyte 
Genomics Inc. and subjected to further DNA sequence analysis. This clone consists of 
a Notl/HinD HI fragment containing the sequence of interest, cloned into the 

1 5 Notl/HinD m sites of the vector Lafinid BA 

(http://lmage.llnl.gov/image/html/libs/lafixudB A. shtml). The clone was received as an 
agar stab, which was streaked out on LB agar plates containing 50ug/ml ampicillin to 
isolate individual colonies. Six colonies were grown in LB medium with antibiotic, and 
plasmid DNA was prepared using the Promega Wizard Plus Miniprep DNA 

20 Purification System (Promega #A7500). These DNAs were subsequently digested 
with NotI and HinD HI restriction enzymes to confirm that the clones contained an 
insert. The insert of one isolate was sequenced using a combination of vector specific 
and gene specific primers yielding a partial nucleotide sequence of human NgR3 of 
1 176 nucleotides (SEQ ID NO: 13). A translation of this sequence provides a partial 

25 sequence for human NgR3 of 392 amino acids (SEQ ID NO: 14). 

The nucleotide sequence of SEQ ID NO: 13 differs from the Incyte EST 
sequence at three positions. Nucleotide positions 12-13 in SEQ ID NO: 13 are CG, 
whereas the corresponding nucleotides in the Incyte Template ID 190989. 1 are GT 
(i.e., positions 12-13 of the complement of Incyte Template ID 190989.1). In 

30 addition, position 641 in SEQ ID NO: 13 is a C, whereas the corresponding nucleotide 
in the Incyte Template ID 190989.1 sequence is an A (i.e., position 641 of the 
complement of Incyte Template ID 190989. 1). This results in two changes in amino 
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acids when comparing SEQ ID NO: 14 to the ORF encoded by Incyte Template 
190989.1: SEQ ID NO:14 contains a valine at position 5, whereas the ORF encoded 
by Incyte Template ID 190989.1 contains a leucine; SEQ ID NO:14 contains an 
alanine at position 214, whereas the ORF encoded by Incyte Template ID 190989.1 
5 contains a glutamic acid. 

The nucleotide sequence of SEQ ID NO: 13 differs from the public EST 
(Accession number: R35699; Version number: R35699. 1; GI: 792600) sequence at 
two positions (within the first 200 nucleotides of reliable sequence). Nucleotide 
positions 12-13 in SEQ ID NO: 13 are CG, whereas the corresponding nucleotides in 
10 the public EST are GT (i.e., positions 12-13 of the public EST; Accession no: R35699; 
Version no: R35699.1; GI: 792600) This leads to a single amino acid change when 
comparing SEQ ID NO: 14 to the ORF encoded by the public EST: SEQ ID NO: 14 
contains a valine at position 5, while the ORF encoded by the public EST contains a 
leucine. 

15 A Bestfit analysis of the partial human amino acid sequence with the full-length 

mouse amino acid sequence indicates that the human NgR3 amino acid sequence is 
complete at the carboxy terminal end and that they share 89.54% identity. An 
alignment of all the NgR proteins is shown in Figure 9. Although the human NgR3 
amino acid sequence is missing the first 25 amino acids, it can be determined that the 

20 human NgR3 protein contains the following features in common with the other NgR 
sequences: (1) eight Leucine Rich Repeat (LRR) domains; (2) an LRR carboxy- 
terminal (LRR-CT) domain; (3) a conserved cysteine in the fourth LRR domain; (4) a 
conserved potential glycosylation site in the eighth LRR domain; and (5) a 
hydrophobic carboxyl terminus. 

25 As those skilled in the art will appreciate, numerous changes and modifications 

may be made to the preferred embodiments of the invention without departing from the 
spirit of the invention. It is intended that all such variations fall within the scope of the 
invention. 

The entire disclosure of each publication cited herein is hereby incorporated by 
30 reference. This application claims benefit from United States provisional application 
60/238,361, filed October 6, 2000, which is incorporated by reference herein in its 
entirety. 
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Key for Sequence Listing: 




SEQIDNO:l 


human NgR2 cDNA sequence derived from genomic sequence 






AC013606 




SEQEDNO:2 


human NgR2 amino acid sequence 


5 


SEQIDNO:3 


mouse NgR3 cDNA sequence derived from AC021768 




SEQIDNO:4 


a mouse NgR3 amino acid sequence 




SEQIDNO:5 


a human NgRl amino acid sequence 




SEQK>NO:6 


a consensus amino acid sequence for NgRs 




SEQIDNO:7 


#1055-1 120 amino acid residues of hNogoA (Nogo-66) 


10 


SEQIDNO:8 


a mature human NgR2 amino acid sequence 




SEQIDNO:9 


a mature mouse NgR3 amino acid sequence 




SEQIDNO:10 


a consensus NgR LLKNT amino acid sequence 




SEQIDNO:ll 


a consensus NgR LRRCT domain amino acid sequence 




SEQIDNO:12 


a consensus NgR LRR domain amino acid sequence 


15 


SEQIDNO:13 


a partial human NgR3 nucleotide sequence 




SEQK>NO:14 


a partial human NgR3 amino acid sequence 




SEQIDNO:15 


a genomic sequence encoding a human NgR2 sequence. 




SEQEDNO:16 


a genomic sequence (complementary strand) encoding a mouse 






NgR3 


20 


SEQIDNO:17 


a mouse NgRl amino acid sequence 




SEQIDNO:18 


a consensus sequence for the NTLRRCT domain of NgR 




SEQIDNO:19 


an consensus NgR LRRCT domain amino acid sequence 



25 



30 
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What is claimed is: 

5 

1 . An isolated nucleic acid comprising a nucleotide sequence encoding 
a polypeptide comprising an LRRCT domain consisting of the amino acid sequence: 



10 



NX 1 WX 2 CX 3 CRARX 4 LWX 5 WX 6 X 7 X 8 X 9 RX 10 SSSX n V 
X 12 C X 13 X 14 P X 15 X 16 X 17 X 18 X 19 X2 0 D L X^ X 22 L X^ X 24 X 25 D 
X^X^X^CtSEQIDNO: 19] 



15 wherein X is any amino acid or a gap and the polypeptide does not comprise 

the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 (human NgRl) or 
SEQ ID NO: 17 (mouse NgRl). 

2. The isolated nucleic acid according to claim 1, wherein X 17 and X^ 
20 are independently selected from the group consisting of: arginine and lysine. 

3. The isolated nucleic acid according to claim 2, wherein the amino 
acid sequence of the LRRCT domain is selected from the group consisting of: residues 
#261-3 10 of SEQ ID NO:2 and residues 261-3 10 of SEQ ID NO: 2 with up to 10 

25 conservative amino acid substitutions. 

4. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 2. 

j 

5. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 4 
30 (mouse NgR3) or SEQ ID NO: 14 (human NgR3). 



6. The isolated nucleic acid according to claim 1, wherein the 
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polypeptide comprises: (a) a NTLRRCT domain, and (b) less than a complete CTS 
domain, provided that a partial CTS domain, if present, consists of no more than the 
first 39 amino acids of the CTS domain. 

5 7. The isolated nucleic acid to claim 1, wherein the polypeptide does 

not comprise an intact GPI domain. 

8. An isolated nucleic acid consisting essentially of a nucleotide 
sequence complementary to a nucleotide sequence encoding a polypeptide selected 
10 from the group consisting of: a polypeptide consisting of residues 3 1 1-395 of SEQ ID 
NO: 2, a polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a 
polypeptide consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid 
is from 8 to 100 nucleotides in length. 

15 9. A vector comprising the nucleic acid of any one of claims 1, 4 or 5. 

10. A host cell comprising a vector according to claim 9. 

11. A polypeptide comprising a LRRCT amino acid sequence: 

20 

NXjW^CXjCRMJQLW^WXfi^X^RX^SSSXnV 

X J2 C X 13 X 14 P X 15 X 16 X 17 X 18 X 19 X 20 D L X^ X^ L X23 X 24 X25 D 

25 X 26 X^ X 28 C [SEQ ID NO: 19] 

wherein X is any amino acid residue or a gap and the polypeptide does not 
comprise the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 
(human NgRl) or SEQ ID NO: 17 (mouse NgRl). 

30 

12. The polypeptide according to claim 1 1, wherein X n and X B is 
selected from the group consisting of arginine and lysine. 
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13. The polypeptide according to claim 1 1, wherein X 19 is glycine. 
[SEQIDNO:ll] 

14. The polypeptide according to claim 1 1, wherein the amino acid 

5 sequence is selected from the group consisting of residues 261-3 10 of SEQ ID NO: 2, 
residues 206-255 of SEQ ID NO: 14, residues 271-320 of SEQ ID NO:4 and amino 
acid sequences thereof comprising a conservative substitution. 



10 



15. A polypeptide comprising a NTLRRCT amino acid sequence: 



C P X! X 2 C X 3 C Y X 4 P X 6 X 7 T X 8 S C X9 X 10 X n X 12 X 13 X 14 X 15 X 16 P 
X 17 X lg X 19 P Xjo X 21 X22 X23 R X 24 F L X25 X^ N X27 1 X 28 X29 X 30 X 31 X 32 X 33 
X 34 F X 35 X 36 X 37 X 38 X 39 X^ X 41 X 42 L W X 43 X 44 S N X 45 X 46 X 47 X 48 1 X 49 
X 50 X 52 F Xjg X54 X 55 X 56 Xy, L E Xjg LDLXj,DN X 6l L X^ X^, 

15 X^ P Xgg T F X <7 G L Xgg Xj 9 L X 70 X 7l L X^ L X 74 C X 75 L X 76 X^ L X 78 

X 79 X 80 Xgi F X K GL X 83 X 84 L Q Y L Y L Q X 85 N X 86 X 87 X 88 X 89 LX^D 
X^ X^ F X 93 D L X 94 N L X^ H L F L H G N X^ X^ X 98 X$ 9 X 10o X 101 X 102 
X103 X 104 F R G L X 105 X 106 L D R L L L H X 107 N X 108 X 109 X 110 X ni V H X 112 
X 113 A F X 114 X ll5 L X 116 R L X U7 X 118 L X 119 L F X 120 N X 121 L X 122 X 123 L 

20 X 124 X 125 X 126 X 127 L X 128 X 129 L X 130 X 131 L X 132 X 133 L R L N X 134 N X 135 W 

X 136 C X 137 C R X 138 R X 139 L W X 140 W X 141 X 142 X 143 X 144 R X 145 S S S X 146 
V X 147 C X 148 X 149 P X 150 X 151 X 152 X 153 X 154 X 155 D L X 156 X 157 L X 158 X 159 X 160 
DX m X 162 X 163 C[SEQIDNO:18] 

25 wherein X is any amino acid residue or a gap and wherein the polypeptide is 

not the polypeptide of SEQ ID NO: 5 (human NgRl) or SEQ ID NO: 17 
(mouse NgRl). 

16. The polypeptide according to claim 15, wherein Xg, X 37 and X 38 
30 represents a gap. 



17. A polypeptide comprising an amino sequence selected from the 
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group consisting of: SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO: 14. 

18. The polypeptide according any one of claims 11, 15 or 17, wherein 
the polypeptide comprises: (a) an NTLRRCT domain, and (b) less than a complete 

5 CTS domain, provided that a partial CTS domain, if present, consists of no more than 
the first 39 amino acids of the CTS domain. 

19. The polypeptide according to any one of claims 11, 15 or 17, 
wherein the polypeptide does not comprise an intact GPI domain. 

10 

20. The polypeptide according to any one of claims 11, 15 or 17, 
wherein the amino acid sequence of the polypeptide further comprises an amino acid 
sequence of a heterologous polypeptide. 

15 21. The polypeptide according to claim 20, wherein the heterologous 

polypeptide is an Fc portion of an antibody. 

22. A method of producing a polypeptide according to any one of 
claims 1 1, 15 or 17, comprising the steps of introducing an isolated nucleic acid 

20 according to any one of claims 1, 4, 5 or 8 or a vector according to claim 9 into a host 
cell, culturing said host cell under conditions suitable for expression of said 
polypeptide, and recovering said polypeptide. 

23. An antibody that binds to a polypeptide of any one of claims 11,15 

25 or 17. 

24. A composition comprising the polypeptide of claim 11, 15 or 17 
and a pharmaceutical^ acceptable carrier. 



30 



25. A composition comprising the antibody of claim 23 and a 
pharmaceutically acceptable carrier. 
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26. A method of decreasing inhibition of axonal growth of a CNS 
neuron, comprising the step of contacting the neuron with an effective amount of the 
polypeptide of claim 11, 15 or 17. 

5 27. A method of treating a central nervous system disease, disorder or 

injury, comprising administering to a mammal an effective amount of the polypeptide 
of claim 11, 15 or 17. 

28. A method of decreasing inhibition of axonal growth of a CNS 
10 neuron comprising the step of contacting the neuron with an effective amount of the 
antibody according to claim 23. 



29. A method of treating a central nervous system disease, disorder or 
injury, comprising administering to a mammal an effective amount of the antibody 

1 5 according to claim 23 . 

30. A method for identifying a molecule that binds a polypeptide of 
claim 11, 15 or 17 comprising the steps of: 

(a) providing a polypeptide of claim 11, 15 or 17; 
20 (b) contacting the polypeptide with the candidate molecule; 

and 

(c) detecting binding of the candidate molecule to the 
polypeptide. 
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SEQUENCE LISTING 



<110> BIOGEN, INC. 

YALE UNIVERSITY 
STRITTMATTER, STEPHEN M. 
CATE/ RICHARD L. 
SAH, DINAH W.Y. 

<120> NOGO RECEPTOR HOMOLOGS 



<130> A116PCT 



<140> 
<141> 



<150> 60/238,361 
<151> 2000-10-06 



<160> 16 



<170> Patentln Ver. 2.1 



<210> 17 
<211> 1260 
<212> DNA 

<213> Homo sapiens 
<400> 1 

atgctgcccg ggctcaggcg cctgctgcaa 
ctcctggccc tgcccctggc ggcccccagc 
ccgcccaccg tgagctgcca ggccaacaac 
agcactcagc gactcttcct gcagaacaac 
gggtccaacc tgctcaccct gtggctcttc 
actttccgcc acttgcaagc cctggaggag 
tcgctggagc ccgacacctt ccagggcctg 
tgccagctca gcagcctgcc cggcaacatc 
tacctccagg agaacagcct gctccaccta 
ctgagccacc tcttcctcca cgggaaccgc 
ggcctgggca gcctggaccg gctgctgctg 
gcggccttcc gcggcctcag ccgcctcacc 
tcgctgcccg gcgaggcgct cgccgacctg 
aacccctggg cgtgcgactg ccgcgcgcgg 
gtgtccagct ccgacgtgac ctgcgccacc 
gcgctccgcg aggccgactt ccaggcgtgt 
cgcgcccgcg gcaacagctc ctccaaccac 
ccagccgatc cctccaccct ctaccgagat 
ggcggggacg cgcctactga ggacgactac 
ggggagcaga tgtgccccgg cgctgcctgc 
ctctcggccg ggctccccag ccctctgctt 



gctcccgcct cggcctgcct cctgctgatg 60 
tgccccatgc tctgcacctg ctactcatcc 120 
ttctcctctg tgccgctgtc cctgccaccc 180 
ctcatccgca cgctgcggcc aggcaccttt 240 
tccaacaacc tctccaccat ctacccgggc 300 
ctggacctcg gtgacaaccg gcacctgcgc 360 
gagcggctgc agtcgctgca tttgtaccgc 420 
ttccgaggcc tggtcagcct gcagtacctc 480 
caggatgact tgttcgcgga cctggccaac 540 
ctgcggctgc tcacagagca cgtgtttcgc 600 
cacgggaacc ggctgcaggg cgtgcaccgc 660 
atcctctacc tgttcaacaa cagcctggcc 720 
ccctcgctcg agttcctgcg gctcaacgct 780 
ccgctctggg cctggttcca gcgcgcgcgc 840 
cccccggagc gccagggccg agacctgcgc 900 
ccgcccgcgg cacccacgcg gccgggcagc 960 
ctgtacgggg tggccgaggc cggggcgccc 1020 
ctgcctgccg aagactcgcg ggggcgccag 1080 
tgggggggct acgggggtga ggaccagcga 1140 
caggcgcccc cggactcccg aggccctgcg 1200 
tgcctcctgc tcctggtgcc ccaccacctc 1260 



<210> 2 
<211> 420 
<212> PRT 

<213> Homo sapiens 

<400> 2 , ^ 

Met Leu Pro Gly Leu Arg Arg Leu Leu Gin Ala Pro Ala Ser Ala Cys 
I 5 10 15 

Leu Leu Leu Met Leu Leu Ala Leu Pro Leu Ala Ala Pro Ser Cys Pro 
20 25 30 
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Met Leu Cys Thr Cys Tyr -Ser Ser Pro Pro Thr Val Ser Cys Gin Ala 
35 40 45 

Asn Asn Phe Ser Ser Val Pro Leu Ser Leu Pro Pro Ser Thr Gin Arg 
50 55 60 

Leu Phe Leu Gin Asn Asn Leu lie Arg Thr Leu Arg Pro Gly Thr Phe 
65 70 75 80 

Gly Ser Asn Leu Leu Thr Leu Trp Leu Phe Ser Asn Asn Leu Ser Thr 
85 90 95 

He Tyr Pro Gly Thr Phe Arg His Leu Gin Ala Leu Glu Glu Leu Asp 
100 105 HO 

Leu Gly Asp Asn Arg His Leu Arg Ser Leu Glu Pro Asp Thr Phe Gin 
115 120 125 

Gly Leu Glu Arg Leu Gin Ser Leu His Leu Tyr Arg Cys Gin Leu Ser 
130 " 135 140 

Ser Leu Pro Gly Asn He Phe Arg Gly Leu Val Ser Leu Gin Tyr Leu 
145 * 150 155 160 

Tyr Leu Gin Glu Asn Ser Leu Leu His Leu Gin Asp Asp Leu Phe Ala 
165 170 175 

Asp Leu Ala Asn Leu Ser His Leu Phe Leu His Gly Asn Arg Leu Arg 
180 185 190 

Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp Arg Leu 
195 200 205 

Leu Leu His Gly Asn Arg Leu Gin Gly Val His Arg Ala Ala Phe Arg 
210 215 220 

Gly Leu Ser Arg Leu Thr He Leu Tyr Leu Phe Asn Asn Ser Leu Ala 
225 230 235 240 

Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu Phe Leu 
245 250 255 

Arg Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg Pro Leu 
260 265 270 

Trp Ala Trp Phe Gin Arg Ala Arg Val Ser Ser Ser Asp Val Thr Cys 
275 280 285 

Ala Thr Pro Pro Glu Arg Gin Gly Arg Asp Leu Arg Ala Leu Arg Glu 
290 295 300 

Ala Asp Phe Gin Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro Gly Ser 
305 310 315 320 

Arg Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val Ala Glu 
325 330 335 

Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp Leu Pro 
340 345 350 

Ala Glu Asp Ser Arg Gly Arg Gin Gly Gly Asp Ala Pro Thr Glu Asp 
355 360 365 

Asp Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gin Arg Gly Glu Gin Met 
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370 



375 



380 



Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly Pro Ala 
385 390 395 400 

Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu Leu Val 
405 410 415 

Pro His His Leu 
420 



<210> 3 
<211> 1383 
<212> DNA 
<213> Mus sp. 



<400> 3 

atgtcttggc 

gggtgctgtg 

tgtcctcgag 

tttgctgcca 

cgcatcacct 

tccaacaaca 

ctagaccttg 

gtgaagcttc 

tttggtggcc 

caagatgaca 

ctatggagcc 

catgagaacc 

accctctttc 

gtggccttgg 

tccctgtggg 

cccgagctgc 

acaggaccag 

gcccgcaagg 

catccgcctg 

aaccggaacc 

gcccccgact 

aagggcaagt 

tcaggcacgg 

cgc 



agtctggaac 
tggaattgct 
actgtgtgtg 
tcccggaggg 
tcctccagca 
tcactttcat 
gagacaaccg 
acgccctcta 
tgcacagcct 
tctttgtgga 
tgggccaagg 
agctacagtg 
tcttcaacaa 
agttccttcg 
aatggctgcg 
ggcaaggcca 
tgtctcctca 
agcaccatcc 
gctccaggtc 
agatctctaa 
atcagcacaa 
gtgctcgcag 
cccttggggc 



cacagtgaca 
gctgttgctg 
ctaccctgcg 
catcccagag 
gggccacttc 
tgctcccaac 
acagctgcga 
cctctataag 
gcagtatctc 
cctggtcaat 
catcttccgg 
ggttcaccac 
cagcctcact 
cctcaatggg 
aaggttccgt 
ggatctgaag 
ccagatcaag 
gtcccatggg 
aggttacaag 
ggtgagctct 
gttcagcttt 
gacccccatc 
cccactcctg 



caatctcccg 
ctcgctggag 
cccatgactg 
gacagtgagc 
agccccgcca 
accttcgagg 
acgctggcac 
tgtggactga 
tacttgcagg 
ctcagtcact 
ggcctggtga 
aaggctttcc 
gagctgcagg 
aatgcttggg 
ggctctagct 
ctgctgaggg 
tctcacacgc 
gcctccaggg 
aaggcaggca 
gggaaagagc 
gacatcatgc 
cgtgccccca 
gcctggatac 



tgcaggctgc 
agctacctct 
tcagctgcca 
gcatcttcct 
tggtcaccct 
gctttgtgca 
ccgagacctt 
gcgccctgcc 
acaaccatat 
tgtttctcca 
acctggaccg 
atgacctcca 
gtgactgtct 
actgtggctg 
ctgctgtccc 
tggaggactt 
ttaccacctc 
acaaaggcca 
agaactgcac 
ttaccgaact 
ccaccgcacg 
gtggggtgca 
tggggctggc 



tcaggtctca 
gggtggtggt 
ggcacacaac 
gcagaacaat 
ctggatctac 
tctggaggag 
ccaaggcctg 
cgcaggcatc 
cgagtacctc 
tggtaacaag 
gttgctgctg 
caggctaacc 
ggcccccctg 
ccgggcacgt 
ctgcgcgacc 
ccggaactgc 
tgacagggct 
cccacatggc 
cagccacagg 
gcaggactat 
acccaagagg 
gcaggcatcc 
agtcactctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1383 



<210> 4 
<211> 461 
<212> PRT 
<213> Mus sp. 

<400> 4 

Met Ser Trp Gin Ser Gly Thr Thr Val Thr Gin Ser Pro Val Gin Ala 
1 ! 5 10 15 

Ala Gin Val Ser Gly Cys Cys Val Glu Leu Leu Leu Leu Leu Leu Ala 
20 " ~ 25 30 

Gly Glu Leu Pro Leu Gly Gly Gly Cys Pro Arg Asp Cys Val Cys Tyr 
35 40 45 

Pro Ala Pro Met Thr Val Ser Cys Gin Ala His Asn Phe Ala Ala lie 
50 55 60 

Pro Glu Gly lie Pro Glu Asp Ser Glu Arg lie Phe Leu Gin Asn Asn 
65 ^70 75 B0 
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Arg lie Thr Phe Leu Gin Gin Gly His Phe Ser Pro Ala Met Val Thr 
85 90 95 

Leu Trp lie Tyr Ser Asn Asn He Thr Phe He Ala Pro Asn Thr Phe 
100 105 HO 

Glu Gly Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gin 
115 120 125 

Leu Arg Thr Leu Ala Pro Glu Thr Phe Gin Gly Leu Val Lys Leu His 
130 135 140 

Ala Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly He 
145 150 " 155 160 

Phe Gly Gly Leu His Ser Leu Gin Tyr Leu Tyr Leu Gin Asp Asn His 
165 170 175 

He Glu Tyr Leu Gin Asp Asp He Phe Val Asp Leu Val Asn Leu Ser 
180 185 190 

His Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Gin Gly He 
195 200 205 

Phe Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gin 
210 215 220 

Leu Gin Trp Val His His Lys Ala Phe His Asp Leu His Arg Leu Thr 
225 230 235 240 

Thr Leu Phe Leu Phe Asn Asn Ser Leu Thr Glu Leu Gin Gly Asp Cys 
245 250 255 

Leu Ala Pro Leu Val Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Ala 
260 265 270 

Trp Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu Trp Leu Arg Arg 
275 280 285 

Phe Arg Gly Ser Ser Ser Ala Val Pro Cys Ala Thr Pro Glu Leu Arg 
290 295 300 

Gin Gly Gin Asp Leu Lys Leu Leu Arg Val Glu Asp Phe Arg Asn Cys 
305 * 310 315 320 

Thr Gly Pro Val Ser Pro His Gin He Lys Ser His Thr Leu Thr Thr 
325 330 335 

Ser Asp Arg Ala Ala Arg Lys Glu His His Pro Ser His Gly Ala Ser 
340 345 350 

Arg Asp Lys Gly His Pro His Gly His Pro Pro Gly Ser Arg Ser Gly 
355 " 360 365 

Tyr Lys Lys Ala Gly Lys Asn Cys Thr Ser His Arg Asn Arg Asn Gin 
370 " 375 380 

He Ser Lys Val Ser Ser Gly Lys Glu Leu Thr Glu Leu Gin Asp Tyr 
385 ' 390 395 400 

Ala Pro Asp Tyr Gin His Lys Phe Ser Phe Asp He Met Pro Thr Ala 
405 410 415 

Arg Pro Lys Arg Lys Gly Lys Cys Ala Arg Arg Thr Pro He Arg Ala 
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420 425 430 

Pro Ser Gly Val Gin Gin Ala Ser Ser Gly Thr Ala Leu Gly Ala Pro 
435 440 445 

Leu Leu Ala Trp lie Leu Gly Leu Ala Val Thr Leu Arg 
450 455 460 



<210> 5 

<211> 473 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Lys Arg Ala Ser Ala Gly Gly Ser Arg Leu Leu Ala Trp Val Leu 
1 5 10 15 

Trp Leu Gin Ala Trp Gin Val Ala Ala Pro Cys Pro Gly Ala Cys Val 
20 25 30 

Cys Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gin Gin Gly Leu 
35 40 45 

Gin Ala Val Pro Val Gly He Pro Ala Ala Ser Gin Arg He Phe Leu 
50 55 60 

His Gly Asn Arg He Ser His Val Pro Ala Ala Ser Phe Arg Ala Cys 
65 70 75 80 

Arg Asn Leu Thr He Leu Trp Leu His Ser Asn Val Leu Ala Arg He 
85 90 95 

Asp Ala Ala Ala Phe Thr Gly Leu Ala Leu Leu Glu Gin Leu Asp Leu 
100 105 HO 

Ser Asp Asn Ala Gin Leu Arg Ser Val Asp Pro Ala Thr Phe His Gly 
115 120 125 

Leu Gly Arg Leu His Thr Leu His Leu Asp Arg Cys Gly Leu Gin Glu 
130 . 135 140 

Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gin Tyr Leu Tyr 
145 150 155 160 

Leu Gin Asp Asn Ala Leu Gin Ala Leu Pro Asp Asp Thr Phe Arg Asp 
165 170 175 

Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn Arg He Ser Ser 
180 185 190 

Val Pro Glu Arg Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 
195 200 205 

Leu His Gin Asn Arg Val Ala His Val His Pro His Ala Phe Arg Asp 
210 215 220 

Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Ala 
225 230 235 240 

Leu Pro Thr Glu Ala Leu Ala Pro Leu Arg Ala Leu Gin Tyr Leu Arg 
245 250 255 

Leu Asn Asp Asn Pro Trp Val Cys Asp Cys Arg Ala Arg Pro Leu Trp 
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260 265 270 

Ala Trp Leu Gin Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Ser 
275 280 285 

Leu Pro Gin Arg Leu Ala Gly Arg Asp Leu Lys Arg Leu Ala Ala Asn 
290 295 300 

Asp Leu Gin Gly Cys Ala Val Ala Thr Gly Pro Tyr His Pro lie Trp 
305 310 315 320 

Thr Gly Arg Ala Thr Asp Glu Glu Pro Leu Gly Leu Pro Lys Cys Cys 
325 330 335 

Gin Pro Asp Ala Ala Asp Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 
340 345 350 

Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly Asp Ser 
355 360 365 

Pro Pro Gly Asn Gly Ser Gly Pro Arg His lie Asn Asp Ser Pro Phe 
370 ' 375 380 

' Gly Thr Leu Pro Gly Ser Ala Glu Pro Pro Leu Thr Ala Val Arg Pro 
385 390 395 400 

Glu Gly Ser Glu Pro Pro Gly Phe Pro Thr Ser Gly Pro Arg Arg Arg 
405 410 * 415 

Pro Gly Cys Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 
420 425 430 

Gin Ala Gly Ser Gly Gly Gly Gly Thr Gly Asp Ser Glu Gly Ser Gly 
435 440 445 

Ala Leu Pro Ser Leu Thr Cys Ser Leu Thr Pro Leu Gly Leu Ala Leu 
450 455 460 

Val Leu Trp Thr Val Leu Gly Pro Cys 
465 * 470 



<210> 6 
<211> 440 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 

<222> (3) . . (4) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MODJRES 
<222> (9) . . (10) 
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<223> Variable amino acid 
<220> 

<221> MODJRES 

<222> (12) . . (13) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (15) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (18) . . (25) 

<223> Variable amino acid 

<220> 

<221> MOD__RES 

<222> (27) . . (29) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (31) . . (34) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (36) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (39) . . (40) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (42) 

<223> Variable amino acid 
<220> 

<221> MODJRES 

<222> (44) . . (50) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (52) . . (59) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (62) . . (63) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (66) . . (69) 

<223> Variable amino acid 

<220> 

<221> MOD RES 
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<222> (71) . . (74) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (76) . . (80) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (83) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (87) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (90) . . (91) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (94) . . (96) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (98) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (101) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (104).. (105) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (107) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (109) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (111).. (112) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (114) 

<223> Variable amino acid 
<220> 
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<221> MOD_RES 

<222> (116) . . (117) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (119).. (122) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (124) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (127) . . (128) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (136) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (138) . . (141) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (143) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (146) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (148) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (151) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (154) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (162) . . (170) 

<223> Variable amino acid 

<220> 

<221> MODJRES 

<222> (175) . . (176) 

<223> Variable amino acid 
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<220> 

<221> M0D_RES 
<222> (184) 

<223> Variable amino acid 
<220> 

<221> MOD_RE5 

<222> (186) . . (189) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (192) . . (193) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (196) . . (197) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (199) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (202) . . (203) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (205) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (208) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (210) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (212) . . (213) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (215) . . (218) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (221) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (223) . . (224) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (226) . . (227) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (232) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (234) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (236) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (238) 
<223> Variable 



amino acid 



<220> 

<221> MOD_RES 
<222> (243) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (246) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (248) . . (251) 

<223> Variable amino acid 

<220> 

<221> MODJRES 
<222> (253) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (257) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (259) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (261) . . (262) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (264) . . (267) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (269) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (272) . . (273) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (275) . . (277) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (279) . . (281) 

<223> Variable amino acid 

<220> . 

<221> MOD_RES 

<222> (283) . . (287) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (289) . . (290) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (292) . . (328) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (330) . . (341) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (344) . . (346) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (348) . . (399) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (401) . . (428) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (431) . . (439) 

<223> Variable amino acid 

<400> 6 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
1 5 " 10 15 
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Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa Pro Xaa Xaa 
20 25 30 

Xaa Xaa Arg Xaa Phe Leu Xaa Xaa Asn Xaa He Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser 



Leu Glu Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa 
85 90 95 

Pro Xaa Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa Leu Xaa Xaa 
100 105 HO 

Cys Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa 
115 120 125 

Leu Gin Tyr Leu Tyr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp 
130 135 140 

Asp Xaa Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly 
145 150 155 160 

Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe" Arg Gly Leu Xaa Xaa 
165 170 175 

Leu Asp Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa 
180 185 190 

Xaa Ala Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa 
195 200 205 

Asn Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa 
210 215 220 

Leu Xaa Xaa Leu Arg Leu Asn Xaa Asn Xaa Trp Xaa Cys Xaa Cys Arg 
225 230 235 240 

Ala Arg Xaa Leu Trp Xaa Trp Xaa Xaa Xaa Xaa Arg Xaa Ser Ser Ser 
245 250 255 

Xaa Val Xaa Cys Xaa Xaa Pro Xaa Xaa Xaa Xaa Gly Xaa Asp Leu Xaa 
260 265 270 

Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Pro 
275 280 285 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
290 295 300 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 315 320 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
325 330 335 

Xaa Xaa Xaa Xaa Xaa Pro Pro Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa 
340 345 350 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 



50 



55 



60 



Asn Xaa Xaa Xaa 
65 



Xaa He Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa 
70 75 80 
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355 360 365 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
370 375 380 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg 
385 390 395 400 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
405 410 415 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa 
420 425 430 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu 
435 440 



<210> 7 

<211> 66 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Arg He Tyr Lys Gly Val He Gin Ala He Gin Lys Ser Asp Glu Gly 
15 10 15 

His Pro Phe Arg Ala Tyr Leu Glu Ser Glu Val Ala He Ser Glu Glu 
20 25 30 

Leu Val Gin Lys Tyr Ser Asn Ser Ala Leu Gly His Val Asn Cys Thr 
35 4 40 45 

He Lys Glu Leu Arg Arg Leu Phe Leu Val Asp Asp Leu Val Asp Ser 
50 55 60 

Leu Lys 
65 



<210> 8 

<211> 390 

<212> PRT 

<213> Homo sapeins 

<400> 8 

Cys Pro Met Leu Cys Thr Cys Tyr Ser Ser Pro Pro Thr Val Ser Cys 
15 10 15 

Gin Ala Asn Asn Phe Ser Ser Val Pro Leu Ser Leu Pro Pro Ser Thr 
20 25 30 

Gin Arg Leu Phe Leu Gin Asn Asn Leu He Arg Thr Leu Arg Pro Gly 
35 40 45 

Thr Phe Gly Ser Asn Leu Leu Thr Leu Trp Leu Phe Ser Asn Asn Leu 
. 50 55 60 

Ser Thr He Tyr Pro Gly Thr Phe Arg His Leu Gin Ala Leu Glu Glu 
65 70 75 80 

Leu Asp Leu Gly Asp Asn Arg His Leu Arg Ser Leu Glu Pro Asp Thr 
85 90 95 
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Phe Gin Gly Leu Glu Arg Leu Gin Ser Leu His Leu Tyr Arg Cys Gin 
100 105 HO 

Leu Ser Ser Leu Pro Gly Asn He Phe Arg Gly Leu Val Ser Leu Gin 
115 120 125 

Tyr Leu Tyr Leu Gin Glu Asn Ser Leu Leu His Leu Gin Asp Asp Leu 
130 135 140 

Phe Ala Asp Leu Ala Asn Leu Ser His Leu Phe Leu His dly Asn Arg 
145 150 155 160 

Leu Arg Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp 
165 170 175 

Arg Leu Leu Leu His Gly Asn Arg Leu Gin Gly Val His Arg Ala Ala 
180 " 185 190 

Phe Arg Gly Leu Ser Arg Leu Thr He Leu Tyr Leu Phe Asn Asn Ser 
195 200 205 

Leu Ala Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu 
210 215 220 

Phe Leu Arg Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg 
225 ~ 230 235 240 

Pro Leu Trp Ala Trp Phe Gin Arg Ala Arg Val Ser Ser Ser Asp Val 
245 250 255 

Thr Cys Ala Thr Pro Pro Glu Arg Gin Gly Arg Asp Leu Arg Ala Leu 
260 265 270 

Arg Glu Ala Asp Phe Gin Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro 
275 280 285 

Gly Ser Arg Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val 
290 " 295 300 

Ala Glu Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp 
305 * 310 315 320 

Leu Pro Ala Glu Asp Ser Arg Gly Arg Gin Gly Gly Asp Ala Pro Thr 
325 330 335 

Glu Asp Asp Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gin Arg Gly Glu 
340 345 350 

Gin Met Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly 
355 360 365 

Pro Ala Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu 
370 375 380 



Leu Val Pro His His Leu 
385 390 



<210> 9 
<211> 421 
<212> PRT 
<213> Mus sp. 

<400> 9 
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Cys Pro Arg Asp Cys Val Cys Tyr Pro Ala Pro Met Thr Val Ser Cys 
1 ~ 5 10 15 

Gin Ala His" Asn Phe Ala Ala He Pro Glu Gly He Pro Glu Asp Ser 
20 25 30 

Glu Arg lie Phe Leu Gin Asn Asn Arg He Thr Phe Leu Gin Gin Gly 
35 40 45 

His Phe Ser Pro Ala Met Val Thr Leu Trp He Tyr Ser Asn Asn He 
50 55 60 

Thr Phe He Ala Pro Asn Thr Phe Glu Gly Phe Val His Leu Glu Glu 
65 70 75 80 

Leu Asp Leu Gly Asp Asn Arg Gin Leu Arg Thr Leu Ala Pro Glu Thr 
85 " 90 95 

Phe Gin Gly Leu Val Lys Leu His Ala Leu Tyr Leu Tyr Lys Cys Gly 
100 105 HO 

Leu Ser Ala. Leu Pro Ala Gly He Phe Gly Gly Leu His Ser Leu Gin 
115 120 125 

Tyr Leu Tyr Leu Gin Asp Asn His He Glu Tyr Leu Gin Asp Asp He 
130 " 135 140 

Phe Val Asp Leu Val Asn Leu Ser His Leu Phe Leu His Gly Asn Lys 
145 150 155' 160 

Leu Trp Ser Leu Gly Gin Gly He Phe Arg Gly Leu Val Asn Leu Asp 
165 170 175 

Arg Leu Leu Leu His Glu Asn Gin Leu Gin Trp Val His His Lys Ala 
180 185 190 

Phe His Asp Leu His Arg Leu Thr Thr Leu Phe Leu Phe Asn Asn Ser 
195 200 205 

Leu Thr Glu Leu Gin Gly Asp Cys Leu Ala Pro Leu Val Ala Leu Glu 
210 215 220 

Phe Leu Arg Leu Asn Gly Asn Ala Trp Asp Cys Gly Cys Arg Ala Arg 
225 230 235 240 

Ser Leu Trp Glu Trp Leu Arg Arg Phe Arg Gly Ser Ser Ser Ala Val 
245 250 255 

Pro Cys Ala Thr Pro Glu Leu Arg Gin Gly Gin Asp Leu Lys Leu Leu 
260 265 270 

Arg Val Glu Asp Phe Arg Asn Cys Thr Gly Pro Val Ser Pro His Gin 
275 280 285 

He Lys Ser His Thr Leu Thr Thr Ser Asp Arg Ala Ala Arg Lys Glu 
290 295 300 

His His Pro Ser His Gly Ala Ser Arg Asp Lys Gly His Pro His Gly 
305 310 315 320 

His Pro Pro Gly Ser Arg Ser Gly Tyr Lys Lys Ala Gly Lys Asn Cys 
325 330 335 

Thr Ser His Arg Asn Arg Asn Gin He Ser Lys Val Ser Ser Gly Lys 
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340 345 350 

Glu Leu Thr Glu Leu Gin Asp Tyr Ala Pro Asp Tyr Gin His Lys Phe 
355 360 365 

Ser Phe Asp He Met Pro Thr Ala Arg Pro Lys Arg Lys Gly Lys Cys 
370 375 380 

Ala Arg Arg Thr Pro He Arg Ala Pro Ser Gly Val Gin Gin Ala Ser 
385 390 395 400 

Ser Gly Thr Ala Leu Gly Ala Pro Leu Leu Ala Trp Xle Leu Gly Leu 
405 410 415 



Ala Val Thr Leu Arg 
420 



<210> 10 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 

<222> (3) . . (4) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MODJRES 

<222> (9) . . (10) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (12) . . (13) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (15) 

<223> Variable amino acid 
<400> 10 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
15 10 15 

Cys 



<210> 11 
<211> 50 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 
<222> (2) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (4) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (11) 

<223> Variable amino acid 
<220> 

<221> MOD__RES 
<222> (14) 

<223> Variable amino acid 
<220> 

<221> MOD_RES . 

<222> (16) . . (19) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (21) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (25) 

<223> Variable amino acid 
<220> 

<221> MODJRES 
<222> (27) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (29) . . (30) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (32) . . (35) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (37) 

<223> Variable amino acid 



18 



WO 02/29059 



PCT/US01/31488 



<220> 

<221> MOD_RES 

<222> (40) . . (41) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (43) . . (45) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (47) . . (49) 

<223> Variable amino acid 



Xaa Cys Xaa Cys Arg Ala Arg Xaa Leu Trp Xaa Trp Xaa 
5 10 15 

Arg Xaa Ser Ser Ser Xaa Val Xaa Cys Xaa Xaa Pro Xaa 
20 25 30 

Xaa Xaa Xaa Gly Xaa Asp Leu Xaa Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa 
35 40 45 

Xaa Cys 
50 



<210> 12 
<211> 196 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 
<222> (2) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (5) .. (6) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (8) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (10) . . (16) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (18) . . (25) 

<223> Variable amino acid 

<220> 



<400> 11 
Asn Xaa Trp 
1 

Xaa Xaa Xaa 
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<221> MOD_RES 

<222> (28).. (29) 

<223> Variable amino acid 

<220> 

<221> MODJRES 

<222> (32).. (35) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (37) . . (40) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (42) . , (46) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (49) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (53) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (56) . . (57) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (60) . . (62) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (64) 

<223> Variable amino acid 
<22 0> 

<221> MOD_RES 

<222> (67) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (70) . . (71) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (73) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (75) 

<223> Variable amino acid 
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<220> 

<221> MODJRES 

<222> (777. . (78) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (80) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (82) . . (83) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (85) . . (88) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (90) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (93) . . (94) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (102) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (104).. (107) 

<223> Variable amino acid 

<220> 

<221> M0D_RES 
<222> (109) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (112) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (114) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (117) 

<223> Variable amino acid • 
<220> 

<221> MOD_RES 
<222> (120) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (128) . . (136) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (141).. (142) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (150) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (152) . . (155) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (158) . . (159) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (162).. (163) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (165) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (168).. (169) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (171) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (174) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (176) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (178) . . (179) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (181) . . (184) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (187) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (189) . . (190) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (192) . . (193) 

<223> Variable amino acid 

<400> 12 

Arg Xaa Phe Leu Xaa Xaa Asn Xaa lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 15 

Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser Asn Xaa 
20 25 30 

Xaa Xaa Xaa lie Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Leu Glu 
35 40 45 

Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa Pro Xaa 
50 55 ' 60 

Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa' Leu Xaa Xaa Cys Xaa 
65 70 75 80 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa Leu Gin 
85 90 95 

Tyr Leu Tyr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp Asp Xaa 
100 105 HO 

Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly Asn Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa Leu Asp 
130 135 140 

Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa Xaa Ala 
145 150 155 160 

Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa Asn Xaa 
165 " 170 175 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa Leu Xaa 
180 185 190 

Xaa Leu Arg Leu 
195 



<210> 13 

<211> 1176 

<212> DNA 

<213> Homo sapiens 

<400> 13 

gagggcatcc ccgtggacag cgagcgcgtc ttcctgcaga acaaccgcat cggcctcctc 60 
cagcccggcc acttcagccc cgccatggtc accctgtgga tctactcgaa caacatcacc 120 
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tacatccacc ccagcacctt cgagggcttc gtgcacctgg aggagctgga cctcggcgac 180 

aaccggcagc tgcggacgct ggcacccgag accttccagg gcctggtgaa gcttcacgcc 240 

ctctacctct acaagtgtgg gctcagcgcc ttgccggccg gcgtctttgg cggcctgcac 300 

agcctgcagt acctctacct gcaggacaac cacatcgagt acctccagga cgacatcttc 360 

gtggacctgg tcaacctcag ccacctgttt ctccacggca acaagctgtg gagtctgggc 420 

ccgggcacct tccggggcct ggtgaacctg gaccgtcttt tgctgcacga gaaccagctg 480 

cagtgggtcc accacaaggc attccacgac ctccgcaggc tgaccaccct cttcctcttc 540 

aacaacagcc tctcggagct gcagggtgag tgcctggccc cgctgggggc cctggagttc 600 

ctccgcctca acggcaaccc ctgggactgt ggttgtcgcg cgcgctccct gtgggaatgg 660 

ctgcagaggt tccggggctc cagctccgct gtcccctgtg tgtcccctgg gctgcggcac 720 

ggccaggacc tgaagctgct gagggccgag gacttccgga actgcacggg accagcgtcc 780 

ccgcaccaga tcaagtcaca cacgctcacc accaccgaca gggccgcccg caaggaacac 840 

cactcacccc acggccccac caggagcaag ggccacccgc acggcccccg gcccggccac 900 

aggaagccgg ggaagaactg caccaacccc aggaaccgca atcagatctc taaggcgggc 960 

gccgggaaac aggcccccga gctgccagac tatgccccag actaccagca caagttcagt 1020 
tttgacatca tgcctacggc ccggcccaag aggaagggca agtgtgcccg caggaccccc 1080 

atccgtgccc ccagcggggt gcagcaggcc tcctcggcca gttccctggg ggcctccctc 1140 
ctggcctgga cactggggct ggcggtcact ctccgc 1176. 

<210> 14 

<211> 392 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Glu Gly He Pro Val Asp Ser Glu Arg Val Phe ■ Leu Gin Asn Asn Arg 
1 5 10 15 

He Gly Leu Leu Gin Pro Gly His Phe Ser Pro Ala Met Val Thr Leu 
20 25 30 

Trp He Tyr Ser Asn Asn He Thr Tyr lie His Pro Ser Thr Phe Glu 
35 40 45 

Gly Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gin Leu 
50 55 60 

Arg Thr Leu Ala Pro Glu Thr Phe Gin Gly Leu Val Lys Leu His Ala 
65 70 75 80 

Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly Val Phe 
85 90 _ 95 

Gly Gly Leu His Ser Leu Gin Tyr Leu Tyr Leu Gin Asp Asn His He 
100 105 HO 

Glu Tyr Leu Gin Asp Asp He Phe Val Asp Leu Val Asn Leu Ser His 
115 * 120 125 

Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Pro Gly Thr Phe 
130 ** 135 140 

Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gin Leu 
145 150 155 160 

Gin Trp Val His His Lys Ala Phe His Asp Leu Arg Arg Leu Thr Thr 
165 170 175 

Leu Phe Leu Phe Asn Asn Ser Leu Ser Glu Leu Gin Gly Glu Cys Leu 
180 185 190 

Ala Pro Leu Gly Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Pro Trp 
195 200 205 
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Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu Trp Leu Gin Arg Phe 
210 215 220 

Arq Gly Ser Ser Ser Ala Val Pro Cys Val Ser Pro Gly Leu Arg His 
225 230 235 240 

Gly Gin Asp Leu Lys Leu Leu Arg Ala Glu Asp Phe Arg Asn Cys Thr 
245 250 255 

Gly Pro Ala Ser Pro His Gin He Lys Ser His Thr Leu Thr Thr Thr 
260 265 270 

Asp Arg Ala Ala Arg Lys Glu His His Ser Pro His Gly Pro Thr Arg 
275 280 285 

Ser Lys Gly His Pro His Gly Pro Arg Pro Gly His Arg Lys Pro Gly 
290 295 300 

Lys Asn Cys Thr Asn Pro Arg Asn Arg Asn Gin He Ser Lys Ala Gly 
305 310 315 320 

Ala Gly Lys Gin Ala Pro Glu Leu Pro Asp Tyr Ala Pro Asp Tyr Gin 
325 330 335 

His Lys Phe Ser Phe Asp lie Met Pro Thr Ala Arg Pro Lys Arg Lys 
340 345 350 

Gly Lys Cys Ala Arg Arg Thr Pro He Arg Ala Pro Ser Gly Val Gin 
355 360 365 

Gin Ala Ser Ser Ala Ser Ser Leu Gly Ala Ser Leu Leu Ala Trp Thr 
370 375 380 

Leu Gly Leu Ala Val Thr Leu Arg 
385 390 



<210> 15 
<211> 143899 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> modified_base 

<222> (2044) . . (2144) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (6609) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (6625) . . (6724) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (14153) . . (14252) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified base 
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<222> (19512) . . (19611) 

<223> a, t, c, g, other or unknown 



<220> 

<221> modifiedjaase 

<222> (22595) . . (22694) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (27825) . . (27924) 

<223> a, t, c, g, other or unknown 



<220> 

<221> modified_base 

<222> (34953) . . (35052) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (40783) . . (40882) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (49000) . . (49099) 

<223> a f t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (62884) . . (62983) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (75528).. (75627) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (87944) . . (88043) 

<223> a, t' r c, g, other or unknown 

<220> 

<221> modified^base 

<222> (111030) . . (111129) 

<223> a, t, c, g, other or unknown 



<400> 15 

aagcacatac 

ataccatcct 

tacagggaac 

tggacgcagg 

agcccaagtg 

ccctgagcca 

tcaatgccgg 

gaaggtgggc 

gatcaagctg 

tgtgaccacc 

tgccttcatg 

tctggtcaag 

tcatggttga 

ggggcatgct 

gattcagggt 



aggtgacatt 
caaacagctg 
aaagtgactt 
ctgtctgact 
aaaaacagct 
agctctgagc 
gtaaaggagt 
tctgtgtgac 
agtgggctgg 
atgggccaga 
tcaccccagg 
gtgccggcct 
gtatttattc 
cctcatttac 
taccctgctg 



acagaactga 
tatgatgtag 
tccaaaggtc 
ctggagtctg 
gatccactct 
aacagaaaca 
gctggaaacc 
tggttcctct 
ccctggacca 
ggctcagcca 
atgttcgggg 
tgggggatag 
gcacccctct 
agatgggcac 
gattcctggg 



cagttatgcc 
attgggtatt 
atgccattca 
aggtgttaat 
tcatttactt 
gaagctctgc 
gctgggatgg 
acactctggc 
caaggtgaca 
gaaggccccc 
cacctcagaa 
tgatggtggg 
tcagtcctta 
atcaaagctc 
attggggagg 



aggcactgta 
aaccccatta 
aaggagggtg 
gctgcctcct 
ggcatctgtg 
attaggagct 
ccgccgagca 
ctggctgcct 
ggtgacctct 
agcctggagg 
aaccttcctg 
tcctcatatt 
caacacccca 
agctaacgct 
gaggagcttc 



cttagcccct 
ataacaaaag 
aatcttaggt 
ccatgggaac 
ctaagctggt 
tgtgagcatg 
ctaggccgtt 
gcaggaagaa 
tctacaccca 
acggttcgga 
agagtgagtg 
cagtgagcac 
tgatgtaggt 
gggaagttca 
caaaatgggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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acaaggtctc tgggcctgtc gggtagctgg tttcctcagg gccccttgca acctctgagc 960 

ttattgcatc aggtgcagcc aggcccgtga gcctcctggc aggggtcctc cacacctggc 1020 

tgtcttttgc cccctgctgg tcacaggagg agctgcagca cctgcctggg ctgcttctca 1080 

ggagggtaca tgaagatccc aggaccgcca gctccatgat aagtggaagg agctccttgg 1140 

agtcaggagc gggagttgag gagtttgagt cctgctctcc agttataggc tatgtgactt 1200 

gtgtagatca cctaaccttg ctcttgattt ccttacctct taaactagca ctaaaagcac 1260 

cccacaaact gtaaagttag ttgtgatgat tgaatgacac catgggtgtg gaagctcttt 1320 

gtaaagtgca aaacggtgtg cagtttgagg gtggttaccc ccagtgccga ttctcagagg 1380 

gcaacatggc taagggcacg agctggagtt aggctgacct gctgcttcca gccctgtgag 1440 

cttgagcaag tcatttaact tcctgagctg cagtttcctc atcagtaaaa tgtgataagg 1500 

atagggttgt tgtaagattt tattaaatgg ggtaataaat gtcaagtatg tagcccatag 1560 

tgagtgcttc agagtttttt tcttttgttt ctttcccccc cgccccgaga tggagcctta 1620 

ctctgttgcc caggctggag tgcagtggca tgatcttggc tcactgcaac ctccgcctcc 1680 

cgggttcaag caattctcct gcctcagcct cccaaatagc tgggactaca ggcgtgcacc 1740 

accatgctcg gctaattttt gtatctttag tagagacggg gtttcaccat gttggccagg 1800 

ctggtctcga actcctgacc tcatgatgct cctgcctcag cccccgaaag ttttgggatt 1860 

acaagtgtga gcccccgtgc cctgccaggt tttttttttt tttttttttt tgtaaaacac 1920 

ccacagggta ttgctgttgc ctgggctgga gtgcggtagt gcaatcatag ttcactgcag 1980 

ccttgacctc ctgggctcaa gtgatcctcc tgcctcagcc tcctgagtag ctgggaatac 2040 

aggnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2100 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnttttgta ttttaagtag 2160 

agacagggtt ttcccaatgt tggccaaggc tggtctaaaa ctcccaacct caggtgatcc 2220 

acccacctca gcctcccaaa gtactgggat tacaggcgtg agacaccgtg cccagccagg 2280 

aggcttattt tcttgataaa ttacccagtc tcaggtattt ctctacagcg atgcaagaac 2340 

agcctaatac atccaggctc agcatcagtg gacccaggtg ggagagctta agatgtcaag 2400 

gtctgaatgc cgcttccaca cacctttggg acctagggac tccctctctt tttctttttt 2460 

cagtagaaga tgttatcttc tcctttctct gaccagtagt- tggtgatggt ttcagagata 2520 

gtttttcagt caagatatat ttcagtggct tcactgagcc caagttccct cgcctctcta 2580 

ggactttatt tccttgtttc tagaagaggg ataacacata ttttctaagg tggttgtgag 2640 

attaagggag ctggtaccgg gtggtgcata aggacaggat agagcaatgg tgagaccact 2700 

caaaaagcga aaagttgacc tgcgagggtg acacttatca aatcagcaca cagtgggagt 2760 

ggaaggaatg tccctcatca gttacaatat ttggagagtg caagttatag aaaacccagc 2820 

cctggccggg cgcggtgggt catgcctata atcccagcac tttgggaggc tgaggcaggt 2880 

ggatcacgag gtcaggagtt caagaccagc ctgaccaacg tggtgaaacc ccacctctac 2940 

taaaaataca aaattagctg ggcgtggtgg tgtgtgcctg taatctgagc tactcaggag 3000 

gctgaggcac gagaatcact tgaaaccggg aggtggagtt tgcagtgagc cgagatcgca 3060 

ccactgcact ccagcctggg caacagagcg agactccatc tcaaacgaaa aaaaaaaaag 3120 

aaagaaaacc cagctctaac tggcttaaac agtaagaaga tctattatat tatccatctc 3180 

aggcagcagc aagcccagag gtaggggact ccaaggttgg ttgatccagg gcttaacgat 3240 

gtcatcaaag acccaggttc tttctgtctc ggcacctctg tctgcagggc cagcttcatc 3300 

ctaagccaga ttgttcttgt cttgattaca agttggctgc tgggccagca gacgctgcct 3360 

gcctccctgt tcatcttcag aagtagaaag tggcccttcc ccagtcatgg aatgaaagag 3420 

tttcctttct gtctgggatt gcttaggtcc acccacctga agccaatgac tgtcaccagg 3480 

aaggtaatat acactgattg tcttaagtca gggttcctga gccagtcttg ggcaaggagt 3540 

gtgatactgt catgattgtc ttgggctcat cagggcagct ctgcagatga gatcaaactc 3600 

caagctacat tattctgaac agtgggaagt aggaaagaga cattttggga gatacaaaac 3660 

acaatgtcta tcccatatcc ctaggtccag gtcacagtgt cttggttgga catcaaatgt 3720 

agaaaaagaa agactgtcca tccatttatc tacctattca tctggttttt gatttttttt 3780 

aaattttatt ttaagacatt ctcactctgt cacccagact ggagtgcagt ggtttgatca 3840 

tggctcatgg cagcctcaac ctcccaggct caagtgaccc tcccatgctc aagtgatcct 3900 

cctacctcag cctcccaagt agctagaact aaaggtgcat gccaccacgc tcagttaatt 3960 

tttgcatttt ttgtagagat ggggtttcgt catgatgccc atgctagtct ggaattcctg 4 020 

aactcaagca atatgcctgc ctttgcctcc caaaatgctg ggattgtagg catgagccac 4080 

tgctcctggc tcatctgttt aataatttat gaaacaacta ctgggtgctg agcacggggc 4140 

caggggctgg agatctagca gggaccaggc agatctctgc caagtcgttg gtttcttaaa 4200 

ggttttgctc ataattcccc ttttcttttc tctttcgttt tttttctttt ctttctttct 4260 

ttctttcttt tttttttttt gagacagagt ctcactctgt tacccaggct ggagtgcagt 4320 

ggtgcgatct cagctcactg caacctctgc ctcctgggtt caagcgattc tcctgcctca 4380 

gcctcccgag tagctgggac tacaggcgcc tgccaccatg cccggctaat ttttgtgttt 4440 

ttagtagaga ctgggtttca ccatattggc caggctggtc ttgaactcct gaccttgtga 4500 

tccgcccgct tcggcctccc acagtgctgg gattacaggc gtgagccacg gcgcccagcc 4560 

agtttccctt ttcaatgagg cctccctgac ctccatactc tactcctcca cctggcccac 4620 

tcagctctac tttttcttcc ccatagcact caagacctcc taacatacta cgtaagttat 4680 

ttatttacta ggcttactgt gtattgtctg tcttcctcta ctagaatgta aactccatga 4740 
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gaatagaaat 

gcatacaaca 

tattgttatg 

tagagtttaa 

ttcagtcatg 

tttatttcct 

agggagctgg 

tgacaaagcg 

tttgggctgg 

cacttgactt 

ctggtcaaca 

tgccttattt 

tgtttctgat 

tgacattttg 

gctgaaacat 

ttatatgttt 

aatagcacat 

tactggttta 

ttctgtcacc 

tgggttcaag 

atgacaccct 

aggctggtct 

ggattacagg 

tatacttata 

ccagaagaca 

cttctagtgt 

taatgtgtgt 

caaaaatttt 

cattcgggga 

caacatggtg 

cggctatagt 

tgaggctgna 

nnnnnnnnnn 

nnnnatattc 

aggctggagt 

gattctcctg 

tatttttagt 

aggtgatcca 

cctggcctcc 

atatatatat 

gatcaatagg 

tgatgagggt 

tttttttttt 

gcatgatctc 

cctcctgagt 

tcagtagaga 

gatccgccca 

cctgcccatc 

gatttgtgtt 

tggtatatct 

tttctattac 

ttgtaaaaat 

gcacaacagt 

ttggtgtcat 

ttccttcttt 

ctccctctct 

ttccttcctt 

tctgtcaccc 

gggttcaagc 

ccatgcccag 

tggtttcgat 

caggtgtgag 

tagttagatc 

tgttgtaaat 



ttttgccttt 

gtcactccac 

agggagtgtg 

gttttctttt 

acgtatttca 

tatttctaga 

tatggggtga 

tcagagacca 

gcacaatgac 

gcagaaggtc 

gagtgagacc 

ggtctattcc 

tggttaagct 

gtcaaggaca 

tcctattgtc 

gtggcaatgc 

acagttatat 

tatatttact 

caggctggag 

tgattctcct 

gctaatatgt 

tgaactactg 

cgtgagccac 

aaaaaaagct 

ttgttatcat 

ggaagtggaa 

gtttgtgtct 

taaatagaaa 

ggccaaggtg 

aaaccccatc 

tccagctaat 

gtgagctgtg 

nnnnnnnnnn 

ttaaaaaaat 

gcaatggcgc 

ccttagcctc 

agagatgggg 

cccccctcgg 

tttacatttt 

ttttaagact 

tgtaaacacc 

tcactttctt 

tttttttttg 

agctcactgc 

agatgggatt 

tggggtttca 

cctcggcctc 

ttttttttgt 

tccctactga 

ttggagaaag 

tgagatgtaa 

tttccttcca 

ttttaatatt 

gtttaaggaa 

ccttccttcc 

ccctccctcc 

ccttccttcc 

aggctggagt 

aattctcctg 

ctaatttttg 

ctcctgacct 

ccaccatgcc 

cttgatccat 

tgcccagaag 



ttatttagtg 

ccacagttgc 

catgttgtta 

ttaagcaatc 

atggcttcac 

acggggataa 

tgcataagga 

gtatttacga 

tcacaactaa 

aaggctgcag 

ctggctcgaa 

atttgaaagt 

taagttttgt 

gaccacatat 

tatggcgtag 

tggtgtaaac 

atagtacata 

atattattta 

tgcagtggcg 

gcctcagtct 

tttgtatttt 

acctcaagtg 

cacgcatggc 

aactgtcaaa 

aggagatgac 

gacagtgata 

ttgcttttaa 

aaagctgccc 

ggtggattgc 

tctacaaaaa 

cgaggggctg 

attgnnnnnn 

nnnnnnnnnn 

ttttttttat 

tatctcagct 

ccaggtacag 

tttcaccatg 

cctcccaaag 

tttaaattta 

agccaagtga 

actgcactgg 

tgtagcctca 

agatggagtc 

aacctccgcc 

acaggcgtcc 

ccatgttggc 

ccaaagtgct 

tatagccatc 

tcaatgatgt 

gtctattcag 

gagttcttta 

ttattgggtt 

gaagtccaat 

ctattgccta 

ttccttcctt 

ctccttccct 

ttccttcctt 

gtagtggcac 

cctcagcctc 

tatttttagt 

cgtgatccac 

cggcctgtgt 

tttgagttga 

tgggtatgct 



tggtgtctgc 

tgaataagtg 

gagaaaaatc 

cacgaattgg 

tgagcccaag 

cacatagttc 

caggatagag 

cggaaatgtg 

taaaactttg 

tgagctgtga 

acgttatttg 

tcctagttct 

tttcctttaa 

acagtggtgg 

tggtcctgat 

acacctactg 

atatctgata 

tctttatttt 

cgatcttggc 

cctgagtagc 

tagtagagat 

atcaccccgc 

ctatttataa 

cagcctcggg 

agctccgtgc 

ttgatgatag 

caagaaagtt 

aggaacaatg 

ttgagctcag 

atacaaaaat 

aggtgggagg 

nnnnnnnnnn 

nnnnnnnnnn 

ttttgagaca 

cagggcaacc 

gcgcccgcca 

ttgtccaggc 

tgctggaatt 

attttaattt 

agcagtggga 

gaccagccta 

acaatatgtg 

tcactctgtt 

tcccaggttc 

accaccacgc 

caggttggtt 

gggattacag 

ctagtggatg 

tgagcatctt 

gtcctttgcc 

tgttctagat 

gtctttcact 

tttctatttt 

atctcaggtc 

ccttctttcc 

tcctccctcc 

ccttcctttg 

gatcttggct 

ctgagtagct 

agagacgggg 

ccgccttggc 

tttcttagag 

ttttgtatat 

tctaaatctg 



agcccctggc 

actaaaggtc 

tgaggcacaa 

ggtagtttca 

ttctttcacc 

ataaggcagt 

cagtggtgag 

gaagcatgat 

ggaggccaag 

ttttgccact 

attggttaca 

ataattttaa 

tacagttaag 

tcccataaga 

gttgtagcgc 

cactgccagt 

atgataatac 

atttttgaga 

tcaccgcaac 

tgggattaca 

ggggtttcac 

ctcggcttcc 

ttattttaga 

caggtccttc 

atattattgt 

gacccagtgt 

taaaaagtta 

gctcacacct 

gagttcaaga 

tagccgggtg 

atcactgggg 

nnnnnnnnnn 

nnnnnnnnnn 

gaatttctct 

tccacctcct 

ccatgctcgg 

tggtcttgaa 

tacaggcgtg 

tttaattttt 

gtggaaaagg 

ttttacattc 

tcgttgccca 

gcctaggctg 

aagtgattct 

ccggctaatt 

tcgaactcct 

gcatgagcca 

taaagttttt 

ttcctgtgct 

cactttaaaa 

ataagtctcc 

ttcttttggt 

tctcttttgc 

acaaagattt 

ctccctccct 

ctccctcctt 

tccttctgac 

cactgcaacc 

gggactacag 

tttcaccaca 

ctcccaaagt 

ttttgtagtt 

agtgtgagat 

gctgttaggg 



ttagtccctg 

cctgccctca 

taaaatttta 

gaggtagttt 

tctctaggac 

tatgagagta 

accgctcaga 

aaagaaatta 

gtgggaggat 

gcactccagc 

gttatacagt 

gtttgttggc 

tgccccataa 

ttataatgga 

aatgcattag 

gatataaaag 

ataactatat 

cagagtctca 

ctccgcttcc 

ggtgtgcacc 

catgttggcc 

caaagtgctg 

gtgtacgcct 

aacagatatt 

ccctgaaaac 

aggcctaggc 

aaataaaata 

gtaatcccac 

ccagcctggg 

tggtggcatg 

gggaggtggt 
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nnnnnnnnnn 

cttgttgccc 

gggttcaagc 

ctaatttttg 

atcctgcctc 

agccactgtg 

aatttctcat 

aactggtttt 

ctgttagcag 

tctttttttt 

gaatgcaatg 

tgtgtctcag 

ttttgtattt 

gacctcaagt 

ccgcgcccgg 

ttgtgatttt 

tattggcttt 

ttaggttatc 

tacatatgat 

gtcctttagt 

cacttgtatc 

acacctgtgt 

ctctccctcc - 

ccttccttcc 

ggaatcttgc 

tctgcctcct 

gcacacacca 

ttggccagga 

gctgggattg 

ttagctctta 

atccacctgg 

attactagag 
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gtgaccaaag 

ccaggctgga 

gcagttctcc 

ggctaatttt 

actcccggcc 

gagccaccgt 

tttgagatag 

agccttgaac 

acaggcatgt 

cttgcaaagc 

cagccaaaat 

tcatgctgca 

gtcagtcttc 

tagtgtgtgg 

aattttcttt 

gctctgttgc 

ccaggttcaa 

caccatgccc 

gctggtctcg 

attacaggcg 

atggtaaaat 

taagcagata 

ctgttatttg 

caaagtggac 

catttaacaa 

ttattatttt 

acagctcaat 

gtagctggga 

ttttttgaga 

actgcaacct 

ggattacagg 

gtttcaccat 

ggcctcctaa 

ttattttttg 

tcaagtgatc 

gcctggcctg 

acatcctctg 

taggcagaag 

aaacaattgc 

tacaaatctg 

agctcagaga 

aaagatttag 

ggatgtccct 

caccagcgag 

tgtcaccaga 

ctcaagcaat 

cacctgacta 

gtatcacttt 

acctcccagg 

tgcgccacta 

ccagggctag 

ccgggattac 

catgtcagag 

acaaagaaaa 

gagccagcct 

ctcagagagt 

tccaggtgct 

catttccctt 

taaaaaaaaa 

ccacccccac 

acgtgtatcg 

ctctgtagcc 

tgggttcaag 

actacacccg 



tgaatttttt 

gtgcaatggc 

tgcctcagac 

tgtattttta 

tcaagtgatc 

gcccagtcct 

ggtctcactc 

ttctggactc 

gcttccacac 

agggtacaac 

ccttgaggct 

tgatcaaaag 

ctctgcagtg 

aaagaggcag 

ctatctttct 

ccaggctgga 

gcgattctcc 

ggctaatttg 

acctcctcac 

tgagccaccg 

gggactagta 

aaatttcaga 

ctatgtgacc 

aattgtcaac 

aaatatgaag 

agaaataggg 

gcatcttcag 

ccacaggcat 

cagtcttgct 

ccgcctcctg 

cctgtgccac 

gttgcccagg 

agtgctagga 

tagagacagg 

ttcccacctc 

gaactcatta 

acctacctat 

cagttcaatt 

taaagttctt 

tgatacaggt 

tgctaagtaa 

tttcacattg 

gaccttcagt 

agcccaatcc 

ctggagtgca 

cctcccacct 

attttaaaat 

gtcacccaag 

ttcaagtgat 

tgctcagcta 

tctcaaactc 

aggcataaac 

gcaggaaagg 

tgggatcatt 

ctcaggtggt 

ttctgcagca 

tcacaaccat 

tgccatagct 

tagcttcatt 

acgttataca 

ttttttgtta 

caggctggag 

cgattctcct 

gctaatgttt 



ctttgtttct 

ttcatcttgg 

tcctgagtag 

gtaaagatgc 

catctgcctc 

tttctcagaa 

tgtcagctag 

acgtgatctt 

ctggctaatt 

atgcctatct 

ctcgggctga 

tgtccgtttt 

tcccaagaac 

acttccaact 

ttcttttctt 

gtgcagtggt 

tgcctcagcc 

tgtaatttta 

ctcaggtgat 

tgcctggcac 

atccatacct 

ctacggatgg 

ttatcttctc 

ctttcttcca 

agctcaacaa 

tcttgttctc 

cctcctggat 

gtaccaccac 

ttgtcgccca 

ggttcaagtg 

catgcccggc 

ctgatctaga 

ttacaggcgt 

ttctcactat 

agcctcccaa 

ttgaagcatt 

aagggtatca 

tatcacaaac 

taggcattta 

attattttta 

cttgtgcaag 

gactccagaa 

ggcccagggc 

taagattttg 

gtggcatgat 

cagcctcctg 

tttatttaat 

ctggactgca 

cctcccacct 

atgtttttat 

ttggactcaa 

caccacaccc 

gaagcatcat 

ttgtgagtgt 

tgagaccaca 

cagctagtgg 

cccaaaccta 

agtgttattt 

ctaaagtgta 

tatacatgta 

ttttttttgt 

tgcagtggtg 

gcctcagcct 

gtatttttag 



tttttttttt 

ctcagtgcaa 

ctggtattac 

agtttcacct 

tacctcccaa 

tttatttgtt 

gcaggagttc 

cccacctcag 

ttttaatttt 

ctacctacct 

ctgtgggtgc 

ctgcttcttg 

acagaattag 

ccgctatgtg 

ttcttttttt 

gcgatctcac 

tcctgagtag 

gtagaaacag 

ccacccgcct 

ttaattttct 

tatagcgctg 

gattgttact 

tggatctcca 

aagagcaatg 

atgaggaact 

ttgcctaggc 

acaagtgatc 

gcacggctaa 

gactggagtg 

attctgctgc 

taattttttt 

acccctggcc 

gagcctctgc 

gttgccaggg 

agtgctgaga 

cactagtatc 

cagctaacgg 

tactctatat 

ctgtttgtta 

ttaaccccat 

gtcacacagc 

cctttctact 

tctcctgggg 

tttgtttgtt 

caatgctcac 

agtagcttgg 

taattactta 

atggtgtggt 

cagcctctgg 

tttttgtata 

gcgatcctcc 

aacccctaag 

ggggttagga 

tcgctgtgtg 

gtccccattt 

aattagaatt 

gtcatttgca 

acttaataat 

atattcttgg 

tgtttctaat 

tgttgttagt 

tgatttcggc 

ctggagtagc 

tagagacagg 



ggagacagag 

cctctgcctt 

aggcgtgtac 

gttggccagg 

agtgctggga 

tttttttgtt 

agtggtgtga 

cctcctgagt 

ctaggactta 

ctcttccctt 

tgttgcctga 

gaactttatt 

accaggaatc 

ctgttgggtg 

ttttttggag 

ctcactgcaa 

ctgggattac 

tgtttcacca 

tggcctccca 

taatacctca 

ttgtgaaaat 

acattctgaa 

ttctttccaa 

atttaaggat 

cattattatt 

tggagtccag 

ctcatgtctc 

ttttttattt 

cagcagcgca 

ctcaacctcc 

gtatttttgg 

tcaagtgatc 

acctggcctc 

ctggtcttga 

ttacagatgt 

aactttgggg 

agcctctgtt 

ccagcataag 

gttagatatt 

tttatagaag 

tagtaaataa 

gggactcatg 

gaatccagcc 

tttgagacaa 

tgcaaccttg 

actacaggtg 

ctattatttt 

ctcagctcat 

agttgcaggg 

gatggggtct 

tgtcttggcc 

gtgtttttgc 

aaggaacact 

tgtatgtgtg 

cccagatgag 

tgaacccggc 

gtttaccttc 

tccttttgaa 

aatatcgggt 

acatatatat 

tttttttaga 

tcactggaac 

tgggattaca 

gtttcaccat 



tctccgtcac 

ctggtttcaa 

caccatgctt 

cttttctgga 

ttacaggtgg 

ttgtttcatt 

tcattgctgc 

agctaggatt 

tttgtccatt 

caagggactc 

tctgcctcag 

cactttgggt 

tgtgttgcca 

attgaagctt 

atggaatctc 

cctccgcctc 

aggtgcatgc 

tattggtcag 

aagtgtcggg 

attaccccat 

gaaatgaggg 

cctggctttg 

gtctataaaa 

caaatgatgt 

attacaatta 

tggtataaac 

atccccctaa 

tttattttta 

atcaccgctc 

caagtagctg 

taaagacggg 

cccctttctt 

ggctaatttt 

actcctgggc 

gagccactgt 

ttacctggcc 

tctcagaatt 

tgcccaaata 

tagtcctcac 

agaaacctga 

agggcagagt 

ggaatagtgt 

atagacaaga 

ggtctcactc 

atctcccagg 

cacaccacca 

ttgagacagg 

tgcgtcctcc 

actgcaggtg 

cactatgttg 

tcccaaagtg 

tgaatgtgac 

gagcagggag 

acaattctca 

ataatggagc 

tcttccagac 

atgattttac 

tcagtctgct 

ttgctgttac 

gtacgtatat 

tggagtctct 

ctctgcctcc 

ggcacccacc 

gttggccagg 
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tgggtcttga 
ttataggtgc 
aataactatg 
acatgctgtg 
aatcactgtc 
atgccaaaga 
ttgggctggc 
aagccttcct 
gaatgaatgt 
gtggaagagc 
tggcccatgt 
cttgagtttg 
ggacccttac 
attaaatggg 
tgggattgtt 
gtctaattca 
ggtaaagcag 
gtatgacctt 
tgcctagatc 
tcacccaggc 
tcacgccatt 
gcccggctaa 
atggtctcga 
acaggcttga 
tggattttta 
ataagggtca 
gtgcagcctc 
acagtgctgt 
ataaacctgt 
nnnnnnnnnn 
nnnnnnnnnn 
gggtagatca 
ctctacaaaa 
cgggaggctg 
atcatgccat 
aaaaaaaaag 
attggttcct 
cctggcttgg 
tccaccacta 
tggccagtcc 
atagtgatca 
catttgtggg 
gggctgcctg 
ctgtactttt 
tgccgtggcg 
acctcagtct 
gtatttttag 
tcaggtgatc 
gcccggtcct 
ggcatagttc 
agctcttggc 
gtgcagcaaa 
atttcaattt 
agtgtcccct 
cagaccctgg 
ggagaggatg 
ttgccagccc 
cctcaactgg 
atcatctgcg 
cattaaactt 
catattaata 
gtagattttt 
ggctgtggca 
agcactgcac 



actcctgatc 
gagctactgc 
aaacctgaaa 
cataccacat 
ccatcctatg 
ccttttctgt 
tctcctttta 
tatgaatgaa 
atgaattgac 
acatgtcctt 
aacacagagg 
aatctgagcc 
catttataaa 
attctctata 
tgtgagggaa 
gggactgcaa 
ggcaggtgta 
gagaaggcaa 
ttctaacttt 
tggagtgcag 
ctcctgcctc 
tttttttttt 
tctcctgact 
gccaccgcac 
tgtgataact 
acattgccag 
tggtctaaaa 
tgtatagtaa 
agagaggccg 
nnnnnnnnnn 
nnnnnnnnnn 
cctgaggtca 
ataaaaaaat 
aggccggaga 
tgcactccag 
aaaaaagaaa 
gagagtagcc 
gccaatgaat 
gggaggtctc 
tcctacagtc 
aggccgttac 
tcttttggaa 
tccctgctcc 
tttttttttt 
tgatctctgc 
cctgagtagc 
tagagacagg 
cacccgcaca 
accaatctgc 
tgagaagcat 
agagacatct 
ttttgtctag 
ttaagaaatg 
taatttctta 
ggtgacagaa 
atctacagag 
tgaagctatg 
ggccattttt 
ggggtgggga 
tctacaacgc 
gcacaaagtt 
cacacattaa 
ggaggattac 
tccagcctgg 



tcaagtgatc 
ggctggccaa 
aactgctcca 
attgggaaac 
cttcgcttta 
tgtcattaac 
gttcagaatt 
ctcgttggtc 
actaagtcat 
tctggcactg 
ccctttggtg 
tgcctgtaag 
atggggacca 
gaaaatattg 
attgtcattt 
atggaaatgc 
gaatagcaat 
gccctggcaa 
ttatttttat 
tggcgcgatc 
agcctcccaa 
gtatttttag 
ttgtgattta 
ctggccgatc 
cctgatttta 
cagagcaaaa 
tcatctttgt 
atatccagta 
ggcacagttg 
nnnnnnnnnn 
nnnnnnnnnn 
ggagttcaag 
tatctgggca 
attgcttgaa 
cctaggtgac 
agaaaaagaa 
aaccttattg 
gaggaagaat 
aagcttgaaa 
aagtagataa 
tctgcaatca 
gagcagagaa 
ctgagccatg 
ttttttgaga 
tcacagcagc 
tgggattaca 
gtttcactgt 
tgcctcccaa 
acattttaat 
cccacaagga 
ggcaaattca 
agcttggtca 
ttgcattaaa 
gtgtgtctga 
gactggtggg 
tgctggagag 
ccttgaggtt 
ctccccagag 
gaggggctac 
ccaggacagc 
gaggaacctt 
agatgagaaa 
ttgagcccag 
gtgacagagc 



cacctgcttt 
tgtatgtttt 
tgttacttcc 
actgctttcc 
tttgtctata 
atggacacag 
tggattttta 
aaactcataa 
tagcatttca 
atgtgtgctt 
cagtggaaaa 
aagctggcta 
ttgtactcac 
gcacaaagta 
gattatcaaa 
cagctgaggc 
gtctgggaat 
aaggggatgg 
ttttattttt 
tcggctcact 
gtagctggga 
tagagatggg 
ccctccttgg 
ttctaacttt 
aactggcacc 
catgggtggg 
atttcttctt 
agtgggtgta 
ctcatgtctg 
nnnnnnnnnn 
nntagtccca 
accagcctgg 
tgattgcagg 
cctgggaggc 
ggagcaagat 
acaatgaatg 
gtcatcccag 
aatggcaaat 
tccagtgacc 
gtggagggtt 
gactcagaaa 
gaggaaggaa 
gagggctggg 
cagagtctcg 
atctgcctcc 
ggtgcccacc 
gttgggcagg 
agtgctggaa 
tgacaagggt 
tgcctctgag 
aggcttgaga 
gttaaaattt 
atgttattta 
gttgagagcc 
agatgggagg 
actctgtatg 
gggcaaggtg 
gacatttgga 
tgacatctgg 
ccctccacaa 
gatctagatc 
gcttgtgcct 
gaattcaggg 
aagattttgt 



ggcttcccaa 
taatacacat 
tgaacccatc 
ctggcttcca 
aatgttgggg 
ctctaagagg 
tccaactact 
aaggctgatc 
cgggaatgga 
gggaaactta 
ctgttgactt 
actgaattgc 
cctttagggt 
ggtgtaaatt 
gacttaggag 
caggcatttg 
taaaagagag 
cctccactca 
tgagacggag 
gcaagctccg 
ctacgggcgc 
gtttcaccgt 
cctcccaaag 
ttaaagagaa 
caattataat 
ggcaactgct 
gctttacgca 
gaatgaataa 
taatctcagc 
nnnnnnnnnn 
gcactttggg 
ccaatatagt 
tgcctctaat 
ggaggttgta 
tctgtctcaa 
agtgtgaggc 
ccacgaagtg 
gggggtctat 
aggtttttag 
tggtccgttg 
tggcctctca 
tttaggtctt 
gtggaatatg 
ctccgtcgcc 
cgggttcaag 
accacgcccg 
ctggtctcaa 
ttacaggcat 
caccctccac 
ttcaccctga 
catgctggcc 
tgatgttttg 
tcttgaatag 
tcccctgcct 
tgaggagggg 
gagcttttca 
gcatatccta 
aacatgtgga 
tgagtagaga 
taaagagtta 
cacagcacag 
gtaatccctg 
ttacagtgaa 
ctcttaaaaa 



agtgctggga 
tcaaataacg 
ttgagtgctc 
agcccagctt 
ttgggggttg 
tcttggcatc 
cagagtgatc 
gataaaacag 
ttctccgtta 
ctgagctaac 
tggagattat 
tttgcttctt 
tattgcatgg 
tgcacgctag 
caggaacagt 
ctaataattg 
gtgaggacgt 
gctacagtca 
tcttgctctg 
cctcccgggt 
ccaccaccat 
gttagccagg 
tgctgggatt 
gcaagacatc 
ttacaacact 
ggtcaccggt 
ttgtcccagc 
accaatgcag 
acnnnnnnnn 
nnnnnnnnnn 
aggccaaggt 
gaaaccccgt 
cccagctact 
gtgagccgag 
aaaaaaaaaa 
tcatggtagt 
aaatggtacc 
gcctccaccc 
gtcctggacc 
ggctacggag 
gttacttctc 
ctcaccctct 
gggaataaat 
caggctggag 
ttattcttcc 
gctaattttt 
atacctgacc 
gagccaccgt 
tcatgtgcca 
caagtccact 
tctctttaaa 
ttttgcatta 
taaatttctt 
gattctagtc 
agtgttggtt 
tgctgcctgt 
gatcagagat 
gacatttttg 
ccagagggac 
tttgacctca 
aagaaaggat 
tgactcagga 
ctatcatcgc 
aaaaaaagat 



12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
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gaggacaggc acagtggctc atgcctgtaa 
gatcacgagg tcaggagttc aagaccagcc 
aaaaatacaa aaaattagcc agctacttgg 
gggaggtgga gcttgcagtg agccaaaatc 
gcaagactcc gtctcaaaaa gaaaaaaaaa 
agagagagag gaaagaaaga gagaaggttt 
cttccacttc ccatggaacc ttggcaagtt 
tgcctctgtg aagtaggggt ggtacctggt 
gtgttgcaca ctatgcccct aggatgggtg 
cctcttgttt cttcccccca ttgtctccca 
tgggattcca ccagcaggag gctactcctg 
ttttaggggg cataaataat aattatgcct 
gaagcaaacc agggtgtggg gtacccaaac 
gcagtcccag cagagagcag ggcacagggt 
tcctttgtcc cccactctgt cctggagcaa 
acccagtata aaggctgctt ctggcccctg 
gactcttgga aggtgagttc ttttctggct 
agaagcagca cctcataggg caaacacgta 
agcagacctg tccccgcagg ggcaggagtc 
cggagactgg aagacaaatg aaatttggta 
tagactctcc agctcagggt ctgttcatgg 
taatcaaaca attaaacaaa tattccctct 
acaaagacag ccccaagaag gacggtggag 
caatgcctct tgctcctgcc ctttctcctg 
aagtctctcc cttccctcct catgcgtctt 
ccccccgctt ttatttgcac tcatgaggcc 
cagctcatct ccggtaagat atcacttgga 
tttgtttgat ttttttttgt tgttgttgtt 
tgttttgaga cggagtctcg ctctgttgcc 
tcaccacaaa ctccgcctcc cgggttcaag 
tgggactaca ggcacatgcc accacgcctg 
gtttcaccat atttgccagg ctggtctcaa 
cctcccaaag tgctgggatt acaggcgtga 
ctttcagaag gaaaaaccac ccacctgtta 
ctcacacgtg ctttctccca ccttcaaaac 
ttcttgtcca tttgtgggtg tgacccgaga 
ctccagtctt gtcttcaggt ggaattccta 
ccacggtgct ggaactagga agctcttgac 
aatgatgccc cccatctgga gagcctagag 
agttcaaagg agcaggagag agacttggct 
gaggaggtca aggcttctgc ctgtcaagac 
gacccagctg ccttagacaa ggacttccag 
cagggaagtc caaggtgcaa gatctgccgc 
gcagaagctc aggtaagtag tagggaggct 
cattcagtct tctgttccct attcatttag 
agaatcacct ggagttgttg ttaaaacaca 
atttaggagg gctgaggtga agcctgagaa 
atgatattgt tggtctgggg agaaccaccg 
gcctgtgtta aattctggtt ctgcctttta 
cctttcaaag ctttagcacc ttcatttgta 
agttgtggtg agcattaaat gagataatac 
gtaataaact ctcaataagc ggtggttata 
attgtcctga cagctaatta agaagcaaaa 
ttttaacctg aaaagggatg ccgggctctt 
agagggaatt cnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
gttagcacat agccagaaca tctagaaggg 
tggaggcaat ggcacttgca aaggctttgt 
tcaggaaagg agagcttcag cttcagtgtg 
gatgaggctt ggaggctggg cagattttgc 
agtttggatt ttctcctact aaggacaaca 
tgacaatgac acgtttgcgt tttggtggct 
cagaagatgg aagataaagc actaaagggt 
ggggacgatg tgagtgacag taaatcattt 



tcccagcatt ttgggaggcc gaagtgggtg 16320 

tggccagcat agtgaaaccc catctctact 16380 

gaggctgagg caggagaagc gcttgaaccc 16440 

ttgccattgc actccagcct gggcgacaga 16500 

aaagatgaga aagaggaagg gagagaaaaa 16560 

tggagtcaaa aagacttaga aattccagtt 16620 

gccttctctc tttctctgaa tctcacattt 16680 

ggagatgatg cggagatgag ggtgaggggt 16740 

agagcttggg agcactgaac ctccctttcc 16800 

ccagctccct gggatctcca cttcactctc 16860 

gagttaaggc gtgttgttca gactggggca 16920 

ggacaatgga cataacatct agggccttct 16980 

aaggcagtag gccccaggag gcaggtccct 17040 

tgagaagact gagcaaactt cattatcagc 17100 

tcattctggc ctcttcccac ttccccaaaa 17160 

aagccagagg cactgagagt ggaggtctca 17220 

gcccaggcag gaccagtgta ggccctggga 17280 

ggaggcctgt ccttaggaac atcatagcta 17340 

tgggctaagg gtgatactgg agagcagcaa 174 00 

cctgagttat ccctcccacc attccttttc 174 60 

caagaggaga aagcaatctt gtttgctctt 17520 

atactatgtg ccaggggcta tactagacac 17580 

tagtgtcctc gctaaaagac agtagatatg 17640 

ctgggaacag tttctgctct tcatctgggt 17700 

tccctttttt cctttttcct acactcccct 17760 

aggaccacag ccttccctct ttagctgata 17820 

ctcagaactg • taacctggaa ctttctcttt 17880 

tttgtttttt tttttgtttg ttttttgttt 17940 

caggctggag' tgcagtggcg cgatctcggc 18000 

caattcttct gcctcagcct cctgagtagc 18060 

gctaatcttt gtatttttag tagagatggg 18120 

actcctaacc ttgtgattcg cccgccccgg 18180 

gccaccgcac ccggcaaact gtaacctgaa 18240 

agatgaaggg ctggtgactg ccccaggctt 18300 

acacactcgt ggtgtcggcc agaagtcagg 18360 

gatctctcct tacctaacac caaggaaatc 18420 

ggaaagctcg agcgacgttg ctggagctgt 18480 

ctgatggcag gttacctctt cttcccagag 18540 

acacaggcag acctaggcca ggatctggat 18600 

ctgacggagg aggtgattca ggcagaggga 18660 

aactttgagg atgaggaagc catggagtcg 18720 

tgccccaggg aagaagacat tgttgaagtg 18780 

tacctattgg tgcggactcc taaaactttt 18840 

actgcggagg acctggggga aaagagagta 18900 

gctagtggtt ctcaaagcct cgcatgcatc 18960 

gctttctggg cctcacctgc acgacttctg 19020 

tttgcattta caacaaatcc ccaggtgatg 19080 

atttaaacaa aaggctttgg tgttagaaac 19140 

ttagctgtgt tacctgggca agttgctttg 19200 

aaacgaagat atatagcacc aacttcttag 19260 

atgaaaagtg tttggaatag tcactgggct 19320 

attattatga gtattatcat ttcctgtagg 19380 

gataggatta agggaggcaa gtaggtttat 19440 

gcctggagac tcagaaactt gaaataaatg 19500 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 19560 

nnnnnnnnnn nnnnnnnnnn ngaattctct 19620 

gtggtaggag tggggattag aggttccagc 19680 

tgaagtggcg taagtgtgga ggtggagcat 19740 

gctggagtgc tgggtgtgaa gagaggtgaa 19800 

tccaaaagag cttggtgaac tgtgataagg 19860 

gcaaactatt gaagagttta aatcgttcag 19920 

cactcgagct gccagccagg tagacagtgg 19980 

gatgaggcag gaagccagtg aggagagaaa 20040 

gttgggttgc tattgtgtgc taagctctgt 20100 
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gctaaattct 
caatcgttcc 
gcagcatgca 
ctgctctagg 
tcaacaatat 
gatccctatg 
aaaagggtat 
cagcactttg 
ggccaacatg 
cacgcctgta 
gcagaggtta 
accctgtctc 
taaactgcta 
aaagaattag 
taatcccagc 
cagcctgacc 
gtggtgcatg 
cgggaggcag 
agcaagaccc 
ctgcattcct 
acctcccatc 
tgtgtgcctt 
aaatgtcttg 
ggtattcatg 
tctaataaac 
gtgtaaagtc 
ctctctctca 
gcaagcatcc 
ggcagcctct 
tccttcccca 
agagaccctg 
tacatgtagt 
atacatatac 
cctacatcaa 
taggtaaaga 
gtgggaggac 
tctcaactaa 
gctcaggagg 
acggtcacaa 
aaaaaaaaaa 
tggaacttct 
gtttaaagct 
nnnnnnnnnn 
nnnnnnnnnn 
tgaactcctg 
tgtgcaccac 
tatatataaa 
ttttagagat 
tcctcctacc 
gagttatttt 
gagcctggag 
agtcacatag 
gtggaaggtc 
caagagcctc 
gcggatcatc 
ccaggagcat 
ctggctgagc 
aaggtggcag 
caacccgctt 
gtgggatgta 
gaaggggtga 
ccagttgggt 
cccctcattg 
ggcactgtcc 



tcacgtgtat 
cagctgaaga 
agttgtccag 
actgggggaa 
tgttggcctg 
agcaagtggg 
aagaaataac 
ggaggccgag 
gtaaaacccc 
atcccagcta 
cagtgaactg 
aaaaaaagaa 
gtcctcaggt 
gatgtcagct 
actttgggag 
aacatggcga 
cctttagttc 
gggttacagt 
tgcctcgaaa 
gctcggcaac 
actgatgtct 
ctgctctggg 
tggtaggcaa 
cctgtgtata 
agaacacagc 
tgtcttgttt 
ctttcttact 
atgttgtgag 
acccagcaag 
cttgagcttt 
agacagaaca 
tttaagctac 
catgtacata 
gaatgtgtca 
ggccaggcat 
tgcttgagcc 
aaaaaaaaaa 
ctgaggtggg 
aactgcactc 
aaaaggaaag 
attctagagg 
ccttcagcat 
nnnnnnnnnn 
nnnnattttt 
ggctcaagca 
catgcctggc 
tatatatata 
agggtcttgc 
tctgcctttc 
taccacacga 
gcaagtgaat 
ttagtgtcac 
tggggctcag 
tgctccccca 
ttctacatgg 
ggtgaggcac 
tcccctcacc 
agacaggtag 
ctcagggggc 
agagtctggg 
tagtgtccac 
tctactcctc 
gctacatcat 
tcggagatgc 



tatttcagct 
agctgaggct 
ccaggattct 
atggtcccta 
ctacatgtga 
atgggggtgg 
aattaggccg 
gagggtggat 
atctctacta 
cttgggaggc 
agattgcacc 
aacaaaacaa 
gcatttttgc 
tcttttgaaa 
gctgaggtgg 
aaccccgtat 
cagctactca 
gaaatgagat 
aaaagaaaaa 
agcctgatgg 
tccctcgctg 
tgcccagatg 
cctcaaagat 
atcctctccc 
aagggtaatg 
gtttccctct 
gtcttgtcct 
ctgacctatg 
gaactgagtc 
cagatgacgg 
ctcagctaag 
tatgttttgg 
actgtttcag 
ggcaccattc 
agggctcaca 
caggagttga 
aattagttgg 
aagatcgctt 
tagcctgagc 
aaagaaaaaa 
gggatggttt 
ctggggaaaa 
nnnnnnnnnn 
tagagatagg 
atcctcctgc 
ttatttcata 
atttctgtat 
tatgttgacc 
aaagtgttgg 
agcataggac 
ctgaactttc 
tccccccaag 
gggcgccccc 
ctttcctgtg 
ctgctgtgaa 
cgctgaggcc 
atacgtttcc 
ggctatgaaa 
gggactcctc 
ttcagaggaa 
tgagtgccag 
cgtcttcggg 
ggactggcgg 
caggtgacct 



aatccatcta 

ctcaaaagct 

aacttagaca 

gtgagatatg 

agagctgtgg 

agtggacagt 

gccaggggtg 

cacttgaggc 

aaaatgcaaa 

tgaggcacga 

actctactcc 

gtaggtactt 

ttatcagttt 

atttgggcca 

gtagatcacc 

ctactaaaaa 

ggatgctgag 

tgcaccactg 

gaaaatttgg 

tggatgggca 

tcatcatctg 

aattggatgc 

cactggggcc 

ttgagtgtgt 

ggatgctact 

cttgatcttc 

tccctttgtt* 

aagaggccca 

actcatcata 

cagccctggc 

ctatacccta 

gataatttgt 

ttgtctgaga 

caggtacttg 

tctataatcc 

gaccagcctg 

gcacagtggc 

gagcccagga 

aacagaaaaa 

ataggtaaag 

ttcacagtag 

ggttnnnnnn 

nnnnnnnnnn 

gtcttgctat 

ctcagcctcc 

tatatatatt 

ataaataaat 

accaggtctt 

gattacaggc 

atacatccaa 

ccgtctttga 

aactgcatga 

tgctgtctcc 

gag'cctcctc 

caagatgctg 

cctgggggtt 

ctacccacag 

gcagggccct 

taggcctggg 

ggctattttg 

ttcctgaccc 

gccatgcagc 

atcaaggact 

gcctgtacag 



acaactctgt 
agtaacttgc 
ccagaggcca 
tgtcgagttt 
aaagcgccca 
aggagggctg 
gtggctcacg 
caggagtttg 
aattagctgg 
gaatcacttg 
agcctgggag 
tctgccatag 
taaaaatcag 
gaagcggtgg 
cgaggtcagg 
tacaaaaatt 
gcatgagaat 
cactctagcc 
aagatctgac 
gaggctcagt 
cttgacatgt 
tatatgagaa 
tccaatgatc 
actacacctg 
tctaaggtta 
ctctcattct 
tactctgatg 
tgtggtggta 
tgggtgagct 
tgatgctttg 
tctcctgaga 
tactcagcaa 
ctatatttag 
gaatacatca 
cagcactttg 
ggtaaaatag 
acatgcctgt 
gtttgaagct 
gaccctgtct 
atccttgatt 
aagtctgtgt 
nnnnnnnnnn 
nnnnnnnnnn 
gttgcccacc 
tgagtagctg 
tttatatata 
aaatatatat 
gaactcctgg 
gtgagccatg 
aaatgttctg 
agaaaccagt 
gctgggacaa 
ccagggtctg 
accatgggca 
gagtaccttg 
gggggcacag 
agacaaatga 
ggctcacgcc 
cccacccagg 
gtgctctctg 
cactgccctt 
tgttgtgcct 
gcgtggacgc 
ggatggtgac 



aaggcaggta 


20160 


ctaagttcat 


20220 


cttttaacca 


20280 


catatttcat 


20340 


aagtgagtta 


20400 


gaacacacat 


20460 


cttttaatcc 


20520 


agaccagccc 


20580 




20640 


aacccaocraa 


20700 


a cao aatcrtci 


20760 


rrrra rrcrat tea 

\_j y CA \J y U w\ 


20820 


agaatgtctc 


20880 


ctcacgcctg 


20940 


c* y ^wyyayii*-' 


21000 


aacfccrcractcr 
a y y*—- ^y y y ^ 


21060 


cacttgaacc 


21120 


taacraaacaa 
u yyy a y HVH y 


21180 


aacacrtfccrac 


21240 


tgtctgccaa 


21300 


aacrcatttaci 
«yy^»»-^^yy 


21360 


aacattctat 


21420 


cctccttcct 


21480 


data ct tact 


21540 


a a t*t a caaaa 


21600 




21660 


aaci caacicta 


21720 


aacfaactaaa 


21780 


tacraaacaaa 


21840 


caggcttgtg 


21900 


tagagtataa 


21960 


Lay u uaauta 


22020 


tea ttttaca 


22080 


attaacagaa 


22140 


y y cty y uuuay 


22200 


fnana ra ft* rr 


22260 


aataccaact 


22320 


ccagtgagcc 


22380 


caattaaaaa 


22440 


ptfccr ccctct 


22500 


tgacageget 


22560 


nnnnnnnnnn 


22620 


nnnnnnnnnn 


22680 




22740 


cicra a t a ca o a 


22800 


tatatattta 


22860 


atatatatat 


22920 


fret caaotna 

y v»" ^ v-* c4 y v y 


22980 


nfarrfaart" 

y OaLU LQQUl. 


23040 


aactcfacrr*aa 

uyi^i^ycay\_.ciu 


23100 




23160 


L-C-cly cl y y y L»d 


23220 




23280 




23340 


uyaL^uyyuyy 


23400 


y cy gg ucauu 


23460 


acagcaacaa 


23520 


caccccactg 


23580 


taaccctttt 


23640 


gcctccgctg 


23700 


cccatcctgc 


23760 


tctcacctgc 


23820 


cccaactcag 


23880 


agcaagtggt 


23940 
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caggcagtgc 
agaaccctag 
agggacctgg 
caccttgcct 
ttgcagttga 
cagctttcag 
gggctagaat 
aggcttggca 
atcagggaga 
gctgtcagcc 
ttcaattacc 
gagcaagagg 
ataccatcac 
ggagactgga 
ctggagacca 
cccaggctgg 
agcgattctc 
gattctgtga 
aggatgggag 
agcctggccc 
ctagcagggc 
ctctctcctg 
tcaggccaag 
ctgaggactt 
cttggttctc 
tgcagatttg 
catgttaaga 
gaaacatgga 
tgttttctgc 
cagggagtcc 
ggttgctacc 
cagtgccttc 
caacttacac 
tccaaaggga 
ctggatctat 
tgggaattga 
ctctcctaac 
cattgttcga 
ttgggctgaa 
gggagggccc 
tcacttcctt 
tagaggaagc 
ggcccccaga 
ttgccttata 
cccctgctgg 
acaataatga 
aggatccttg 
gctctctttc 
tcacactcca 
tctgaccctt 
cctggattat 
tatttttaac 
tggtgtcctg 
ccctgcaagc 
ctctgccaca 
gcatctttgc 
gcccaatctt 
gcctcccttg 
tttttctcag 
ctgctccatg 
tatctccagc 
acagaataca 
ggggctcagg 
gctcagtggc 



ttttcatttt 
gagatcaggg 
cttgcttaat 
tgtttagtcc 
ggaaaccgag 
tggcagcctc 
aggtagagtt 
ttttaaaaca 
gactcagccg 
agggctgcag 
cccatggcct 
gctaagcgac 
ttctgttacg 
caaagcaggg 
ccattgaggg 
agtgcagtgg 
ctgcctcagc 
ctgcttttgc 
aaagagactg 
tggcactgag 
ccttctgtcc 
ggctggcctc 
ctgctgtccc 
ctgatgagag 
aaaccttcaa 
cagatatccc 
aagcacccgc 
gctagagaac 
tgccctgaca 
tgggaacgta 
aaatccatca 
accctgacca 
ctccaggtac 
tggtccatcc 
tttccgggtt 
ggcccagttg 
ctcttcactc 
ggtttcttcc 
atgccttcct 
caggcatccc 
aaaggcagcc 
tgaaatacct 
atggcctccc 
attttcctct 
gcgctgctga 
tgacaaaatg 
accatcactc 
cttcatctgc 
tgttcactcc 
cttgtcacct 
tgtggcagcc 
acaggggctg 
tctcatctgt 
tctggcccca 
gccacagtgc 
atgtactagt 
accttctctc 
gctgtcccat 
cgtttatcac 
ggaatgccac 
acctagagca 
gtggacagtc 
gagtttgagc 
cctgaagttg 



ctctgtgcgt 
gtacctcccc 
aataagaata 
tacaacacct 
gtacagggtt 
cagaacctgt 
ggtatattag 
acaaacactt 
gctgattctg 
tcatctgaag 
ctccataggg 
agagaccccg 
ttctattata 
gaataccagg 
tttttttttt 
cacgatctca 
ctcccgagta 
tctttttgtg 
ggagagaagg 
ccttcctcct 
aggacacatg 
agtgcccttg 
caggagggca 
acggtgtgag 
gtgtgcatga 
cgctgagatt 
tttgatgcgt 
gtgggtttct 
tcgaaggcac 
gaggcagcag 
gaccacgcta 
acctgctgct 
ccaccttcat 
agcacctgcc 
ctccaggaag 
gcctcttcgc 
tctctttttg 
actcagcctg 
gagctttgca 
ctaatgcacc 
tatcctgctg 
ggcttgatga 
ctccatgccc 
tcatgttgga 
gggactctcc 
acaaccatta 
cctgagttca 
tcaagccaga 
atcctcagta 
ccactgctgc 
tttagtctcc 
cagtcacctg 
ctcagagtaa 
gcacttcact 
ttcctagtgc 
ccctgtttct 
cttcttcagg 
gtacaattgc 
taacactctg 
ctcctcgatg 
gtgcctggca 
ctgcgaggtc 
cacttgccaa 
gagcatttgc 



ttacatccag 
attttacaga 
gctaatgcag 
ctttgaggta 
gcacagttag 
gttataaggg 
atatttattg 
ctcatctcat 
agtcacagtc 
ggctgactgg 
ctgctcagga 
tatcttctct 
ggcacagagc 
aggcaggatc 
tttttttatt 
gctcactgca 
gctgggattc 
ttcatccaaa 
aaatccagtg 
ctaccctctg 
ggccgcctgt 
attgtgccgg 
tctgcttatg 
agcttcccac 
gaatcactta 
ctgagggccg 
gtgtcattgg 
atgggtttcc 
atctgaaggg 
ccagtgactt 
ctgcaagatc 
tgtgggtttt 
ccttcccctc 
ttccaggaag 
tgtttctagt 
cctacccctc 
gttttagttt 
tgggagtctc 
accgtgatca 
tctctctgag 
taattgactc 
cactttggtt 
accctgaaga 
cagtccccac 
cctgtgtatg 
tcaagtgctc 
gcctcactgc 
accctggagt 
cagccagcag 
ctctccagtc 
cacatctgcc 
tcaggacata 
aagccaaagg 
cccctctagc 
tccggaagtc 
agaacattct 
tctttccctg 
aacctcccta 
cctatctctt 
gcaggaatct 
catagtaggt 
aatgccatcc 
tatcacacat 
cctcaaggct 



cagcttgttg 
tgaggaaact 
agtgctgact 
gatgcgttaa 
gtcattcacc 
tacacgctaa 
ctgtataaca 
acagtttctg 
tctcatgaag 
ggttggagaa 
cacagcacct 
cacataatct 
aaccctgata 
cttgagggct 
gagacagtct 
acctctgcct 
accatggagt 
cagtccctgt 
gcctccctcc 
ctcctaatgg 
cctcacccca 
agagaggaag 
tctagggcag 
ttcccacctt 
gtgggggata 
agatgaggcc 
gtaggggagc 
catagaaaca 
ggaggggcca 
cccgtactcc 
caaaagctca 
ggcatcacct 
tccctgcctc 
cgcagttctg 
agattgggtt 
cttcctccag 
gtgacctttg 
tatgctgcag 
gagaacccca 
accctctgat 
cccctgttgg 
ctatgtctgc 
aatttcccaa 
ttgcatctct 
tgatggagta 
cgttggtgca 
agcggtctcg 
ttccttgatg 
cagcttctac 
ctagccacca 
ctggccccgc 
agtctcttca 
ctttactatg 
tccccctcct 
tcaagtgtgt 
tctccagata 
actgtcctct 
ctgcccgctt 
gcttattgtc 
gttgacttgc 
tctcagctaa 
ctgttattag 
acaggaggtg 
ggggaccaaa 



ctttctccca 
gaggctagga 
gtgcacttgg 
tatcttcatt 
caagatcaca 
agtcttgtta 
aatcacccca 
acagtcagaa 
acatagtcag 
tctatgtcag 
gctttccctt 
cagacgtagc 
tactgtggaa 
gtcttgttgg 
tgctctgtcg 
cccaggttca 
cttgaaccca 
ttatcctaag 
ctgctagcag 
tgagggtccc 
gcctactgac 
cgctggacag 
ggacaccttc 
ccttcccatc 
tttgtccaaa 
tgtgaatctg 
aacactttga 
tggatttctg 
ggccaagaac 
tcagggacgg 
ccaatgccat 
gtctcatcaa 
ccgaggctcc 
gtcttctgat 
ggcgaggggg 
cctccacaca 
tcctgcacac 
tgtgagtctg 
gggaagggtt 
ggcagggagc 
agtcttccct 
tgtttgaaac 
gggcagccat 
ctcctggttt 
acaggacatt 
ggcagcaggc 
gcagagggca 
tttctctccc 
acaccccaaa 
acatctctag 
tgtctcagtc 
catcactctg 
gcctaaaaag 
ccattgttca 
tccctgcttg 
tctgcaaggt 
tctcagtgag 
ctctgcttgg 
tgaccgccac 
ttgatcgtgg 
atgtttgttg 
tggaggaagt 
tgagaaccca 
gagcccatgc 
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aaagagcccg aacgcttaag caccaccctg cctggccagc ggggnnnnnn nnnnnnnnnn 27840 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 27900 

nnnnnnnnnn nnnnnnnnnn nnnncccact gcgcctggcc cattactttt aatggcaaaa 27960 

accacaatta cttttgcacc cacataaata gttaccatgg gctgagcatg gtggctcagg 28020 

cctgcaatcc cagcactttg ggaggctgag ccaggcggat cacttgaggc caggagttca 28080 

agaccagcct ggccaacatg gtgaaacccc gtctccacta aaaaatacaa aaattagctg 28140 

ggtgtggtgg cgcgtgcctg taatcccagc tattcaggag gcagaggttg cagttcactg 28200 

aaatcatgcc actgcactcc agcctgggcg acagaatgag actctgtctc aaaaataaat 28260 

aaataaataa ataaatattt accatgtttt gaccacctgt tatgtgccaa ctgtattact 28320 

taaaaacacc catgggaggc tgggcacagt ggctcacgcc tgtaatcgga cactttggaa 28380 

gggcaagcgg ggaggatccc ttaaggccag gagttcaaaa ccagcctagg taacacagta 28440 

agccctgtct ctacaaaaaa taaaaaaatt aactgggcat ggtggtgtgt gcctgtaacc 28500 

ccagctcctc gggaggcaga gggagaggtt cgcttgagcc cagcagtttt aggttgcagt 28560 

gagccaggac caagacacta cactccagcc tgagtgacag agcaagacac tgcctctaaa 28620 

caaacaaaca aacaaaagcg acctgtgggt aggtaggaac aggctcatag tacagatgag 28680 

aaagcagagc ttggagggct caagcgattt gccaagcaga ggtccaagcc gaggtctctc 2874 0 

tgaatccaaa gttaattccg tctatcatat caccacagcc ctctctgccc cagggagagt 28800 

ctctgcccac tccagccact cacgtgtaat tgacttcctc aggggcagga aaggcttcga 28860 

tgggccagtt gagggtgcag ttcagaaaga taaggcaggc caggccagac caggtgaaca 28920 

tgatgaccac gaaggccaca ccggcatcgt agatcagctg tgagaggagg gggcaggccc 28980 

gtgggggaga ctgcctggcc ccagacccca ccaaggtaga tcccaggcct cagaggcctt 29040 

aaagaagttc tcttctcccc ttgtccttgt gcccaatttg cagatgagga aaccaagacc 29100 

agaagtttag agtcagactc agaagaccca tcattccttt ttctttttca cttgaggccc 29160 

cctagagagc tatgaaatag tctccacaaa gcctgaagtt gctggccact ggctcaaaat 29220 

atctctgaaa tttccattat cttaaaaaaa tacatacatt tttgcctatg actccacaaa 29280 

cattcatgtt catgttcgca caaaaatgtc catttcatag tacgtacaaa ggaaacttag 29340 

tgctctaggt ttaccgggcc taatcgtgtt tatcctgccc- cttcctggca cattccccag 29400 

gggaaaaggc aaacccagac tgctcatgct cagccttttc tcacctttcc caggtcctcc 29460 

cacgtgcaac aactgggggg gttggggaga gggaggtgca' agtgctctgc ccaagggctc 29520 

tcaaccccag ggcaggtaag ttctcaattg aatgagattc tgtgcaaatg tgtcagccct 29580 

tcttatggaa gaagctgatg caccatctgt cctcttgtcc tccccatacc atctgaccag 29640 

gataattaat gtctgctctc ccctcaggct cctgctcaaa cctttttctc tgcagtcttg 29700 

gaccttggtg ccttttcctc cctaggggca ggacagagct tcaaagggcc acacccccaa 297 60 

atgtgtggag gtaagatctg gctcttcaaa cactacttca gttgaaaaga agggagaact 29820 

gcccaccctc catgcctgcc caccagaaca actgatggcc cccccaccca tgcgctctct 29880 

caaactcctt tggagacact gagcaaaagt accttcttta gtactctttg taaagtgcaa 29940 

aacggtatgc agtttggtac tgcccaccgt ggaggttgag gagcatggca tggctcaaag 30000 

ggtcctttga tatttgacag aggaaattga ggcccccatc ttgcactgag ctaaaacttt 30060 

ggtcccctgg cttcgaggta caccaggttg acctgtccag gatccagcct ggcataaact 30120 

cactttgtga ccttggacca aaccacccat cctctctgga aggtgtggaa aaatgtggcc 30180 

ccaaaggctg aataaagcca gagagtcagg gaccttgaac gcatgtgaag gggctggact 30240 

tgattctgta ggtgaagcta aaccactgaa ggtttttcag cagtgtgtga gccagttccc 30300 

catctgagat ctttctggaa gtcacgtgag tgacagagta cagagaaaaa gaatcagagg 30360 

cagggagacc agctgagaaa gcttgctgtg gcccaggaga gagggggaag gcctgcattg 30420 

ggatgatgac agagaaagga gagcggagaa gtcagacccg tgggtcagca ctagctgctg 30480 

ctcactcggc cccacccggt tcttgtgtca agacaaaaag aaaacccagg tggcctcata 30540 

ccttgattcc tgggaacgta atggcagaag aggcgtaaga gccaatcatg agggccatta 30600 

acgtggagcg caggttccca aacatgttgg gcagctgagg agggaaagca gcacccatga 30660 

ggtggggaca ccgtgaccct tgcccagcat tcccagccct gctccataca atagctccag 30720 

gagacgcagc agaaaagccc caaggtaaaa caaacagaaa aatcaatgtg ggaaactgta 30780 

ctctgccccc tgcctacaca gtcacagtgc cctttagctt caaaaaggct cccagacacc 30840 

cctcagagag acattttgtt aattttgttt aattccaggt ttcccaagtt tgttacgtaa 30900 

cacctctgaa aaacacatgg aataggtgct taagaaacac tgatcttggc tgggcgcagt 30960 

ggctcatgcc tgtaatccca gcactttggg aagccgaagc tggtgggaag cttgaggtca 31020 

ggagttcaag accagcctgg acaacatggt gaaaccccat ctccaccaaa aatacaaaaa 31080 

ttagctaggc atggtggcat gcgcctgtaa tcccacctac tccagaggct gaggcaggag 31140 

aatcgcttga acctgggagg tggaggttgc agtgagccga gatcgcacca ctgcacttta 31200 

gcctgggtga cagagcgaga ctatgtcccc accccccaaa aaaaaagaaa agaaaagaaa 31260 

gaaacagtga tcttgtccaa cccatttgag atgagacaat tgagacccag ggaggaaaag 31320 

tgtactcaag ttcacagagc acattaatgg ctttctcccc attgtcgttg tcccagccct 31380 

aacccaaggc tgtgaccatg gctgtgtccc ggtaataggc agtgcctctt aaccctctcg 31440 

gttgacgtcc cagcccagtt tctgcctaat caggacaaat cacatcctgg gaggtgaggg 31500 

tggaaataag ggagggaact gagccagggc agacagtctc cagaggaggt ggctctgacg 31560 

cagagcaggg tcagaaccca caccaggaga gaatttaatt gatcatgtgt tccactcacc 31620 



34 



WO 02/29059 



PCT/US01/31488 



tgcctcagcc aagccctcag ggcaggggaa ggcaaagtca ggatgccctt cgcacacacc 31680 

ctcctctggc cccaccatcc tccccaagtc actagatccc acagctgaga aggaccttag 31740 

gatccgtaca aagcctaaac acactccaca gagggggaaa ctgagactct gaagggaggc 31800 

ctcaacagct ctggtaaaaa aggcgtttag gccgggcgca gtggctcaca cctgtaatcc 31860 

cagcactttg ggaggccgag gcgggtggat tgcctgagct caggagttcg cgaccagcct 31920 

gagcaacacg gtgaaacccc gtctccacta aaatacgaaa aaattagccg ggcgtggagg 3198 0 

cgtgcacctg tagtcccagc tactcgggag gctgaggcag gagaattgct tgaacctagg 3204 0 

aggcagaggt tgcagtgagc cgagatcgcg ccactgcact ccagcctggg cgacactgcg 32100 

agactccgtc tcaacaacaa aaaaaaaaaa atggtgttta aacacatata actaaattat 32160 

ccttccccct tcccctgaag tggctggctc aggaaaaacc tctacccact caggcagagg 32220 

ttttcctgca ccctgcatcc gtgaggcacc actgccaagg acgccaggga aggctgccag 32280 

gcctggagag gggcagggcc ccctcccctc caaggggcca caaacgctgt ctgcgcccag 3234 0 

taccgtgggt aaggcgaggc cggccggcta accccgggct ggcggccttg cagcgtgcgt 32400 

ggcaacagca gctgggcccg caagactcag cacgggacgt cctcgtccaa gtctgggcca 324 60 

agagcagcgg cccagggggc ggggccggcc agagggagcg gggagaggct gaggggcggt 32520 

gccagcgccg gaccctgcca ttggctggag attacaggag gcggggacat agcagggagg 32580 

agccgctgga caagccccac ccggccgcca gggagggtct gaggtcaaga gccggagaga 32640 

agggatttag ggccctgggc caagttgcac agcagggaga aggggctgcg cagaggggcg 32700 

gggagaaagg gatccgcttc cttcctttag agctgtgaaa tgtccccggt tggaattaaa 32760 

ggcggctgct ggggagaggt gaaattcagc caaaaccacc cagtcaggca gcccttctca 32820 

gagataaaca gtccgagcca gcccggccag gaaccttccc ctccaacctc cctaagcctt 32880 

taacactcct aagcctttaa cgcgtttaca cactcacata aataaacaca ctttgagcaa 32940 

cacacataca ccactcacca catgtaatag gtcaagccat gtgcacgacg aggtgtcgac 33000 

aatttcatat ggttcaacct agtacactca caaacacacc taccaactca tggctttcac 33060 

agggacgggg tcacacaccc actctcccac gacatggcaa gcgtgcacac gctatctcaa 33120 

gctgctccct ccccctcaag atcatgttac ccagttttat tttcttccca gcacctatga 33180 

cgactgacat aatttattag tttacttgtt tattgggtta • tctgtgcccc tcacccccaa 33240 

aatgtaacct ccagcaggga ggatgactcg gtcagtcctg attgtgctgt agtccaggac 33300 

ctagaacaga gctccatgga cattcatggg ctctgtacac acaaacacac acattaacat 33360 

acaccccgac acacagcctc atccacacac acacagcctc acacctgctc tttgcagcca 33420 

cctgcacagt ttctcacaca ctcacttgat ctagtgatct gcgtccacag gcccctcccc 33480 

cagcccactc atactgccct caccccactc actctgccct caccccactc gggggaactc 33540 

tgctgccagg ccaggcctgt gacactcacc gtgagtgaag tgaacgttag gcagatgcca 33600 

ccaaagccat tcagggacag cgccaggaat atcaacggag acagagctgg aaaggggaaa 33660 

gcagcagatg agggcatttg gggagctgtg ggaagccaag ggcgggagct ggggtaaaca 33720 

tccgccttca tcccacctat tcttttcttg tggggccaca agaggacaga caactcacct 33780 

tccacgtccc gggaggccag ggccatgagg gtgcaggacg cagtgaagca ggcactgtgg 33840 

agacacaggg aagggcgagg ggttggcctg tgagcacccc ccctcccctc cccctgcagc 33900 

acggtccctg tcctcccgtt ccccatagcc cagccacctc acctgccaac cagccgcacg 33960 

ggtcgggggc caaagcggtc catgaggatc cccagtggca gggtggtggc gctgagcacg 34020 

aaggaaccaa tggtgaagcc caggttgagc atctcgtcct gctggtcaca gcctggccac 34080 

ctgcgctgct catcctgggt ggtgttggtg ctgctctcag ctgaggaggg ggaagggagg 34140 

gctcagcaca tgacaccagg aacagctggg cacaggagac agcagcccac agtcaggcgg 34200 

cctgctttca aatcccatgc caagtgcctt tgggggtacc ctagagtcac atctcctctg 34260 

atggggctgc tccagaaatg gcagccatta gtacctgacc ctgggagagt cttgtgcaca 34320 

cacagcctga ggcttcaact agctcaaatg aaatactgga cataaaagta tttactaagt 34380 

tgtaatatgc actcagtgtc caagcttagg gggttgtgga cccccaacaa gaagtgcccc 34440 

catatctaga ggcaaaggca aaggcagtga gtggtactct aatggctata acaagaattc 34500 

attaaaatgg cccggcgtgg tagttcatgc ctgtaatccc accactgtgg gaggctgaga 34560 

caggcagatc gcttaagcct acaagtttga gaccagcctg ggcaacatgg taaaacccca 34620 

tctctaaata aaaaaaagaa atttagaaag aacactaaaa cttagaggaa gctttcccga 34680 

taaatgatag tctgataaaa taatagctaa tacttattga gcacttaact atgctccagg 34740 

cactgttata agtcagttaa taaagtatcc cgttccctag gtgatgaagc tgaggcacag 34800 

aatgagaacc aggcactgcc ctccagtccc ctctagaagt ccacttggag gacttgtcct 34860 

taacggtaaa ctgccaactt ggagttgtga caagttaagg agaaaagcta gtgataggag 34920 

acaaagggct gcttcgcttt actcaatgct cannnnnnnn nnnnnnnnnn nnnnnnnnnn 34980 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 35040 

nnnnnnnnnn nntccctccc tccctccctc cctcccttcc ttccttcctt ccttccttcc 35100 

ttccttcctt ccctccatct tctccacacc tggtatcatc atacagaagc agagaggact 35160 

gcacttggtg aaagtttcaa ttctcctgtg tggagaggtg agcactgagg aaggggtggg 35220 

ggctgtcaaa ggagacttac ccaatctttc cagcccacca atcccttgcc cagtgtttct 35280 

ataaaataag ggccttttgc atctgattta agtaggaagc tgattcctga gcccctcaga 35340 

tctgctgaat cagatagcta agggggccct ggaatctgca ttttagcaag cggaggtggg 35400 

ttatgaagca ctccaaagtt tgagacacgc ctcaaaggtg gagtggttct gtggggggca 35460 
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gaaaggaaaa 
tggaaagggt 
caagccagac 
cgaagcagac 
cgtggagacc 
ctcatccccg 
gtccatcact 
ttctacccca 
cctttgctat 
ctcgcccctg 
gaccccgcca 
ggattgcgcg 
cgcatatatg 
aggcttcgaa 
cgcaccctct 
ccagctcgct 
cagacgccct 
ggccagaggc 
ggctgcagat 
cgggcagagc 
accctcggag 
cgcacgtggc 
actattcgca 
agccagccga 
aaaaagtcag 
ctttgccaag 
tctctcttat 
aatgctgtct 
actcttggcc 
gctatccttt 
cccttgggtg 
caacaccaac 
atcctaaacc 
tgtcccagag 
ctcttgtcct 
ttatttattt 
aaggaagctc 
actttcggca 
tctaaagcat 
aaatcctggg 
gttgggaaca 
ctagttaatg 
caccaaccgc 
taagacttgc 
gacccccaac 
agggctgtgt 
gcacatggga 
accgctagaa 
aacccctcta 
gaaggttttt 
taaatctacg 
cttttggttg 
gcggtcacct 
tgctggagcc 
aatagcagga 
agtcatctgc 
agccagaata 
ttaatgttca 
agggcctggt 
cactgtagga 
tctgaccttt 
agacacagtt 
gacacgcggg 
gcaaataatg 



tgcaaagggg 
gatgccaggt 
cggggaggag 
acttgcaggg 
ctggagagcc 
ggcaccagcg 
cccccggcca 
cagtccccag 
ttgcgatccc 
ggagggcggc 
ccccgtcccg 
gcagggtctg 
cccccctttc 
attcctcgtt 
cccctcccca 
tccgcccctg 
tctcctagga 
aagcttcgag 
aacgcttgca 
aagccccggg 
ccagccgcag 
gcttaccccc 
aattgaggat 
acccgctggt 
tgttggaagt 
tctccagacc 
taacttctca 
tacagagcca 
ccaggagcag 
ccacagactg 
tgtgccttaa 
ctcttaggct 
aatgttagtt 
gcacaagcag 
cactgatggt 
actttaaagc 
aaaatttgca 
ggccctaggg 
tctgtcctaa 
aagtaggggg 
gggaggtgag 
cccagctgag 
caaagacagg 
aagcagcagc 
ccctaaccca 
ctctctcaca 
cctcaggggt 
ggagataaga 
ggatagggac 
tttcctcctt 
gccatggctc 
ctcaggcctt 
cagagccgct 
gggcacggta 
ggcccagatt 
ccttagcctc 
aatacagcta 
gcaattctgc 
gggtgcccac 
gtctgggctg 
ggagatgtta 
caaggaactg 
ctctgactcg 
gcaattctta 



gaaggggtca 
gtggggagta 
ggggccacag 
tgcaccccgc 
cccaaccttg 
gccttcccgg 
gagcccacca 
tcctagctca 
agaactcgtt 
cccattaacc 
cggcgggggt 
ccacagggca 
tgggaaaaga 
ccctatcctc 
gccatctgtt 
cccagccccc 
ggccggaatt 
accccccacc 
aggacgggag 
aagaggcagg 
ccacgcagcg 
acccccgggt 
cccggacaca 
gggtgctagc 
cgggagtctg 
cctgaggaca 
agagaaacaa 
acctttggag 
ctgagaaccg 
ccgaggttcc 
cctctctggg 
atagtttgga 
ttcccttcat 
gtgcagggct 
gtcttctctt 
ctcactttaa 
taaagtttta 
atgctatggt 
tcctctgtat 
caggagctgg 
ccagacagcc 
cctgggtgaa 
cgcacaccag 
accagagagg 
ggacacagct 
ttcacacata 
agcctgtttg 
ggcaccctgg 
tgtcttcagt 
tttgcagtct 
tatgtgcatg 
gattgcctgt 
aagcaccttc 
atagaagagg 
cacctttagg 
ccacagggag 
gtacttatta 
aaagtggctg 
accatatggc 
gtcaggctgc 
accaatggga 
ggatgcacag 
gttcacatcc 
ctgagtgcct 



cacttgggga 
acagatagag 
ccaaggtgag 
cctctcttcg 
tttctggggg 
gaggctcaac 
acgctcctcg 
tctgcataaa 
ccccaccccg 
ctcgcgaccc 
ctgggggtga 
gaggccaggg 
cggggagggg 
cggcccccgc 
ccactccgca 
tccccaagcc 
tctgcctcca 
aaccaccacc 
tcggggaggg 
ggttttccct 
ccgcctgccg 
ccgctcctgg 
gagtgcagag 
caattctgat 
ggctcagagc 
ggttttccta 
agacaaaggg- 
gtgggggaga 
gaaagaagct 
aaattgagct 
cttgtttcct 
taaaatgaga 
ttggggactt 
ggataaataa 
gatatagata 
tgttaaaggt 
agataaaata 
gggaagtttg 
ggagaaaagc 
actccctcca 
agaggcgaac 
gaaggatggg 
ccagtctctc 
gaacctgccc 
ggcacctcag 
cacagacaca 
ccgatccccc 
tctcctccaa 
caatggagcg 
ttacaaaaat 
ttacaggtag 
catccaggtc 
agtgggccca 
taggaaggca 
gcaaggagag 
gagaaggcgg 
tgtgtagtca 
agatgagact 
actcactagg 
tcccgaaatg 
tcccgttcag 
cctggtggac 
cactctgcat 
ccttctcagg 



aggtttcaga 
gaggcaaagt 
acaggtcagc 
tggcaatctg 
gtgggtcaga 
acgcagatac 
aggtccgacc 
gctccaatta 
agcccgtttc 
gggccgctcc 
ggggcgcgcc 
ctctccggga 
ggcttctcct 
acccctcctc 
gcgccgcgac 
ccggggagtg 
tctcccaccg 
accgttgcga 
tgtagggcga 
cccgggtcgc 
ggcacaccaa 
ctcgcgctca 
accccggcaa 
tttgtacttt 
agcagggatc 
tctgaaaacg 
agggaaaatg 
tggccaaggc 
tgggacctcc 
ccaccaccta 
acagcgacaa 
tagctgtgta 
gctctaacct 
ggtatgtctt 
attttaaagc 
aaatgtaaat 
ggagactcca 
agtcatacct 
cagcttcctg 
agcactaagg 
gggctggcat 
ggtgtgggga 
acttcccttt 
tcctggccct 
gcccctttcc 
tgcatgtgtg 
caagaggtac 
cccaaggagg 
ttgacttagg 
agaacttctc 
aaaagccata 
ccttggtctg 
tcccattggc 
acaggaatcc 
agaaacagag 
ccatttttct 
ttgttccacc 
tctcaggtat 
taggtatgag 
gggccttctg 
ggtggcgaga 
agaaggcttg 
tactcactgt 
gctgttgtgg 



caataccgag 
gagtggagac 
agccagaaac 
agaccgagga 
gaggaagcct 
ctggataggc 
ttgtccctcc 
acatgttttt 
ccgccgcttc 
tggcggtcct 
ctggggcaga 
aaaaggcagg 
gggagactcc 
ctccccgcca 
aaacacggct 
ggggagtgag 
gggtccggct 
gggccggtga 
gtttaaagga 
cgccccccgc 
ggacctggcg 
gcctccccag 
gcctactgaa 
acaaaaacaa 
tgcgatgtga 
gaggggacag 
gcttagctgg 
ctctgaggtc 
tttctgcaga 
acactgtgtg 
gaaagaatga 
gaacagacag 
ccagggctta 
tctgcaggat 
ttcacgttat 
atagtataac 
aaaaagtgtt 
tagcattctt 
gatgtacccc 
gcagggcatg 
gccaagcgtc 
agacaccccc 
ttatttcctc 
ggaaggggcc 
ttctgaaagg 
cacactcatg 
caggaggcag 
aagaaagctc 
gggcgttttt 
ttggtattta 
tggggcactc 
agaagtctat 
ggcgtactcc 
caggagtgag 
tcaagtaggt 
ccaggtcctg 
agtatctcac 
aacaagtggc 
gaaggcacag 
ggctcacccc 
ggaggctctc 
gaaggcccag 
gtgactttgg 
cgaagatgta 



35520 
35580 
35640 
35700 
35760 
35820 
35880 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
38580 
38640 
38700 
38760 
38820 
38880 
38940 
39000 
39060 
39120 
39180 
39240 
39300 
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agttaaaaaa aagtatgcat catgcttagc acatagtgag tgcttggtaa atagaagcag 39360 

ttatttcatc acaattcttt gggaggaggg tttacgtgtg ggtggcccca cagggcagat 39420 

gaaagatcag cgtcagggag gcagatgagt tcaatgtaag gaaaagactt actaacagca 394 80 

gcagggctgc ctcgtgcagg agtgggtgcc ctaccactga gggtatctaa gctaagaggg 39540 

aagggtcccc tttcaggggt gctggagaca ggatcccaca ctaggtagaa ctggattgga 39600 

ccaatggtgc ctgaacacag gcccaagagt caggactggc cacttcacaa agcacctgga 39660 

gtttactaaa aacagactcc taggaggtca ggcactgtgg ctcacgcctg taaccccagc 39720 

actctgggag gccaaggtga gaagatcatt tgaggccagg agtttaagac tagcctgtgc 39780 

aacatggcaa gaccctgttt atctgtacaa aatttttttt taaaaaatta gccaggtatg 39840 

gtagccatca cctgtggttg cagctactca gaaggctggg gccggaggat cgcttgagcc 39900 

caggaatcag aggctgcagt gagctgtgat tttaccaccg cactccagac tgggcaacag 39960 

aacaagacac cttctctaca aaaaaaaaaa aacaataggg ccgggcgcgg tggctaaggc 40020 

atgtaatccc agcactttgg gaggctgagg agggcagatc acgaggtcgg gagatcgagg 40080 

ccatcctggc tagcacggtg aaaccccgtc tctactaaaa atccaaaaaa aaaaaaaaaa 40140 
ttagctgggc gtggtggtgg gcgcctgtgg tcccagctac ttgagaggct gaggcaggag . 40200 

aatggcatga acccgggagg cggagcttgc agtgagccga gatcgcacca ctgcactcca 40260 

gcctgggcaa cagaatgaga ctccgtctca aaaaataaaa ataaaaataa ataaataaat 40320 

aaaataacaa taaattaaaa acaaaaacag actcctacgg tcaggctgag atatcctgat 40380 

tcaggggact ggggaatctg tatttttaac actccgtgag gggttctaaa aggcagacaa 40440 

cttggaaacc tgcagattag agacctctga ggtgcctctg gctgagatga gtgagggatg 40500 

gcaccacata caaggcccta cccctgcccc caggagagtg gctcctgctc cccccacacc 40560 

aaccctcgct ctcacccaga agggctctcc tttcaggggt cccaccatcc ccatgaaaag 40620 

tggctgctga agcaaggcga acacagcact ggtgagggac tgcaggcctg tcagcgtccc 40680 

aaaaggggtt ggatgggaac ctgtccccaa aacgggagat caaagggtgg tgggggcctt 40740 

tcagcccagg caagaacttt ttcttttcct tcccaacatg ggnnnnnnnn nnnnnnnnnn 40800 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 40860 

nnnnnnnnnn nnnnnnnnnn nncactccag cttgggtgac- agagtgaaac cctgtctcaa 40920 

aagaaaaaaa aatcttaaag aataaggata taaagaaaga aaatattttt gtgtagctgt 40980 

tcaatgtttg tatttcaagc caagtgttat tacaaaacag tcaaaagttt ttaaaaattt 41040 

aaaagtttat aaagtaaaaa agctaagtaa gctagggtta atttttttat cgaacaaaga 41100 

aaaatatctt tgtataaact tagtgtagtc taagtgtaca ttgtttttat tttatttatt 41160 

ttttattttt ttgaaatgga gtttcactct tgttgcccag gctggagtgc aatggcatga 41220 

tcttggctca cggcaagctc tgtctcctgg gttcaagcga ttctcctgcc tcagcctccc 41280 

aagtagctgg gattataggc acccgccacc atgcatggct agtttctttg catttttttt 41340 

ttgaaatgga attttgctct ttgacccagg ctggagtgca atggtgcaat ctgggctaaa 41400 

tgcaacctcc acctcccagg ttcaagagat tctcctgcct cagcctcctg agtagctggg 414 60 

attacaggca tgcaccacca cactcggcta atttttgtat ttttagtaga gacagggttc 41520 

tcaactaaag agaaccatgt tggccaggct ggtctagaat tcctgacctc aggtgatcca 41580 

cccacctcgg cctcccaaag tgctgggatt gcaggcatga gccaccatgc ccagccagta 41640 

tacagtgttt ataaagcctc cagtagtgta cagcaatgtc ctagaccttc acattcactt 41700 

actactcact cactcactca cccagagcaa ctgccagtcc tgcaagctgc atgcatgata 417 60 

agtgccctat ataggtgaac cattttttaa tattttatac tatattttta ctgcaccttt 41820 

tctatgatta gctacacaaa tgcttaccat tgtgttacaa ctgcctacag taatcagtac 41880 

agtactatgt atgggtttgt agcctaggct ataccatgtt gcctacgtgt gtagtcgtct 41940 

atactgtcta gtttgtacac tctatcatgt ttgcataaag ataaaatcac ctaatgacac 42000 

atttctctga gtgtattcct gttgttaagc aacacatgta taaacattta caagaaatag 42060 

ctcaaatttt tttttctttt gatacagggt cttgctttgt cacccaggct ggagtgcagt 42120 

ggcgcaatct cggcgcactg cgacatctac ctccccggtt caatcgattc tccggcctta 42180 

gcctcctgag tagttaggac tacaggcacg caccaccacg cctggctaat ttttttgtat 42240 

ttttattaag agatggggtt ttgccatgtt ggctaggctg gtctcgaact cctgacctca 42300 

ggtgatctgc ccgccttggc ctcccaacat gctgggatta caggcatgag ccaccatgcc 42360 

cagccattac gtttttttgg ttgtttaatt tttttttttt taagagacag attctcactc 42420 

tgtcatcaag gctggagtgc aatggcacaa ccatagctca ctgcagcctc caactcctgg 42480 

gctcaaggga ccctcctgcc tcagccttcc cagtaactga gactacaggt gtgagccacc 42540 

atgctcagct aattattttt tatcttttat tttttgtaga gggggggtct ttctatgttg 42600 

ctcaggtttg tctcaaactc ctgggctcaa tcaattctcc tgctttggcc tcccaaaggg 42660 

ctgggattac aggtgtgagc ctgaaaacct tctagtgtgg aagtggaaga taggcccagg 42720 

ccacttatgt tttcaagtta agcaaggttt aggtcactta tgaagcctga ctagttttgt 42780 

ttgcttaagg gatctgcagg cctgacctcg gttttcattt gttttaacag tgtctatgtg 42840 

tatgtgtgtg tttatgtacg tgcatgatgg ggggaaagct cagaaatcaa gtaagccaaa 42900 

cacaaacatg taattataag cagggataaa ttctatgatg aagaagtatg ggccacggga 42960 

gagtacttgt gccagtctgg tgatcaggaa caatgtcctt tgggaagtga catttgagcc 43020 

atgccctgaa gtacggtagg agttggttag gggtgaggca gtaagaccca gagctggggc 43080 

ttcctgcaca agctcagctg ggcactgagg acccagtgga ctctgctaca gggcagtgag 43140 
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gagcagaaag gctgaggaag gctgggtgtg gtggctcaca cttgtaatcc cagggctttg 43200 

agaggctgat gggggaaaat cggtagagct caggagtttg agaccagcct gagcaacata 43260 

gcaagactcc atccctgtaa aaagctttta aaaattagct gggtgtggtg gtatgcatct 43320 

gcagtctcag ctactcaaga ggctggggta aggattgctt gagcctagga ggtggacgct 43380 

gcagtgcgcc acgattgtgc cactgtactc caacctagga gacaaagcga gatcctgtct 43440 

caaaactgaa tgaataggct gtgtgcggtg gctcactcct gtaatcccag cacttttgga 43500 

ggctgaggtg ggtggatcac ctgtgattgg gagtttgaga ccagcctggc caatatggtg 43560 

aaacccgata caaaaattaa ctgggcatgg tggctcacat ctgtaattcc agctactcgg 43620 

gaggctgagg catgagaatg tcttgaaccc ggggggcaga gggtgcagtg agctgagatc 43680 

gcaccactgc actccagcct gggagacagc gagactccat ctcaaaaaaa aaataataat 43740 

aataacaatt aaaaaaaaat taaaaggcca gggagcactg gcagcctgtc caaggtttca 43800 

ggtcacttta gtaaagggag aacaatggct cctcccagga cctctgggat ctcagcattg 43860 

atacgacagt catggaaatg ctagggccca ggcagaccat ctcagggaaa acaagtggct 43920 

ctgccctgcc ttggccactt cctggccctc tgcatgcccc agggtctcag caccaagctg 43980 

ttctcagtga gtagctctca tttagtgcca gggctctcgg gcttacatcc tacgatgacg 44040 

atggaatgca taaaagatgg ggctgtgata gcccagagct aggggtttga atctcatgag 44100 

atgttcatgg agccctggga gggagctcag tgcaagttca tttctctttt ttggttgaga 44160 

tggggctcag aggaggaagg acttgttcaa agacacacag ggagtgtttc agtgtgggac 44220 

ggaggtttat ggagaaaggg tgaccatcca aggcttggac aaagatcatg acttcgacca 44280 

gcaagcctca actctgtaga cttggtgggg gccaggccct cccaaacaca cctgacaggt 44340 

gtctgtggtc ttggggacat tgtcgctccc cttcctgctg atgctctgct gtccctctcc 44400 

catgaagcgt atctcttcgc cgtcccccat ccttgctgag agaggatggg ttctcttctg 44460 

accaatactg aagatcttta gtaaagttct cttttttttc attttctgaa agtccctctc 44520 

ttgagaaatc aggacaagtg agtcagggcc aggacaaaaa acagtgtggg acgagtgtgg 44580 

tggctcacgc ctgtaatccc agcactttgg gaggccaagg tggcggatca cttgaggtca 44640 

tgagtttgag actagcctgg ccaacatggt gaaacctcgt ctctacaaaa tacaaaaatt 44700 

agccaggcgt ggtggtgcat gcctgtaatc ccagctattc- gggaggctga ggcaggagaa 44760 

tcacatgaac ccaggaggcg gaggttgcag cgagctgaaa ttgggccact gcactctggc 44820 

ctcttggcaa cagagccaga ctacctctca aaacaaaaac aaaaacaaac gacaaacagt 44880 

gtagactttg tgtttttctc aaaagcactg tcaagccagt gcccgcagca gtgggcctag 44940 

acacctccag tcttgcctca gggtcagttt ccagcctccc tggacacttc ccccaggtat 45000 

gtgtactttt tgattgtcct aaatccagag tctgtggcct gacctggttt gtcacagctc 45060 

tcagtccctc cccatcccga atcccaggga gccgcaggtg tgtgcagaag aggcacacca 45120 

cactcaatac atcttgcatc ctcgctggac ccaatccatt ggcttggtga tgtacagact 45180 

gagcctcatt atagccgttc gttcctgttg acctttccag atcaatctgc cagcttggct 45240 

tctccgagtt tcgcttgtca gcatttctcc aatcccatca tgtactttgg acctctttgt 45300 

tgggtggctt gctttatctg aaattttcag atttgacttc aggtctctcc tttgtccctt 45360 

aatatggctt aatggtggac cctgtcaggg gtagagaaaa tattgaggag ccctgacttt 45420 

gaggtgcaca agttagaggg ttagacaagt ccagccacaa ccagcccaag ctgcagtgta 45480 

gggaggcctg tccagctgct ccacggttga gggtggagca tacaggaagg cttccttctt 4 j|^jj 

gctgcagccc aggtgttctg gctgccctag ctgcctggct ttggtagaag aaagaaaggc 45600 

tctgtctctg acttgtcaac taatggcact atgagattgc acataattaa cctgggtctg 45660 

ctcttccaaa agccttgggc ctctgactgc aacatggagt ctgggtatca ctccccatcc 4 ^20 

ctgcgccact cacctgctct ggcgctaggc gtgtgcctaa tcacttaatt tctctgtgct 45780 

gcctcttagg tatcacttcc cctgatccca aatacttacc aggtgtggga tgacacctga 45840 

ctagttactc cttggaggta tctgcttctc accggggact ccgaaaccaa acgaaaagca 45900 

aggccaagcc cagcctaaag gacgcttcct acatgacttc aggcttgcgg gggctggagc 45960 

gtgggggtgg caatggagtt ggggggggct cagggagggg atgtggaagt gctttgcttt 46020 

gcaaactcta gagaaccgtg taaataggag tgattattct gtcccttccc tttctttcca 46080 

acaggaatca gcatcccaca gcccatgttc agctatgaag aatggaaact gaggctccgg 46140 

gaggggtata gggaggagcc agcagggtct tgagttcata ttagtgccct ttcctccata 46200 

ggcacatctg tgttttcttt tattttattt tgaatttaat tttttttttt tttggcagag 4 6260 

tcttgctctg tcgcccaggc tggagtgcag tggcgcggtc tcagttcact gcaatctccg 46320 

cctcctgggt tcaagtgatt ctcctgcctc agcctcccga gtagctggga ttacaggtgt 46380 

acaccaccac acccagctga tttttgcaat tttagtagag acagggtttc acagtgttgg 4644 ° 

ccaggcttgt cttgaaatcc tgacctcaag tgatctgcta gcctcggcct cccaaagtgc 465 ^ 

tggtattata ggtgtgagcc actgcgctcg gccacatctg tgttttaaat gagaggaaag 46560 

gggataatgt gcattttgtg gaagcttggg ccgtttgtgt ctaggactct tatgatcttc 46620 

ataagttttc ccccagggag gacactgttc cacttaggga gtcaggaccc ccagtcctta 46680 

caagattcag cctctcaaaa tggagacagc agttccaggc ctgggctggg ttctgttcac 46740 

actaggagag ggcaagtgag tggtgtttgg gatgtgggga agtattatga aaacagagat 46800 

gctccaattc ctagtgatag gaaaccatta agctacttgg catcttaaaa ccaagagcgg 46850 

ttcaagttct gagattgtta acacacctta caacaccgcc gccgttatta ggaagaagct 46920 

ctgtttgatg acgtcccaca ctgtgggtac ctttatgaac aggaatttgc tttttcaaat 4 6980 
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cccagagaag 
gaattcaaga 
gttcagagct 
ggtagagctc 
tggctcatgc 
gagtttgaga 
agccaggcat 
ttgcttgaac 
ctgggcaaca 
acatactgtt 
cagctatata 
cctcctcagc 
tctaactttt 
aggcatgatg 
ttgagcccag 
aaaataaatt 
gagaggagaa 
gcactgcagc 
cgcggtggct 
gtcaggagat 
aaaaaagaaa 
ttgggaggct 
gatcgcacca 
ggaaaaagaa 
tatcacatct 
atgcagtatt 
catttgggct 
cactacttct 
cctgatcata 
cttttttgtg 
ggtggaaggg 
cacctggttg 
tatgtggtta 
gacaacaaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ggacatttgg 
actagtatct 
ggcctctgta 
agtaggcgct 
ctggtgtgat 
agaggtggag 
gatccaggtg 
tggctggatg 
aggggtgaag 
ctgagaaagt 
gtgtgtgttg 
ggagaagcat 
ggagatgaag 
atgacataac 
atccccaata 
aatggccaca 
atcactactg 
ggggtagtca 
acaagactgc 
ccacttgctg 
taacctccaa 
atggttggcc 
gtctgttgag 
actttaagtg 
acttctgagt 
ttatttatca 
ataaatgaat 
gaggctcaga 



taagattaaa 
atgactgacc 
ctcccaatag 
aggaatctaa 
ctgtaatccc 
ccagcctggc 
ggtggcagac 
ctgggaggca 
gagggagact 
acaggcacag 
actaccacct 
agaggtaaca 
ctatcaatta 
actcatgcct 
gagttggaga 
agctggtcat 
tctcttgagc 
ctaggtgaca 
caagcctgta 
ggagaccatc 
gaaaaaaaaa 
gaggcaagag 
ttgcactcca 
aaaatatata 
gcgaaattaa 
ccgtcgtgtg 
gttaccaggt 
ggtgagcgta 
aggtaggtgt 
tatttctggt 
aagcaggagt 
gcatggagct 
gctccatgca 
gctgatgtgg 
nnnnnnnnnn 
nnnnnnnnnc 
taatgtctgc 
ttttggtaga 
acaaaaaagt 
aaaacataag 
tagctctggg 
gagggtggta 
aaaaacggtc 
agatggtgaa 
gatgaccaga 
gaccacaaaa 
agttcagtct 
cttgggtgac 
acttgggaat 
ccgtggtggg 
tttaaccatc 
aaggctactg 
cagaggtcaa 
gtgggcacct 
tatggtaaga 
gaggagcttg 
gaaacacaaa 
cctagaaatg 
cccaggctaa 
atcctgacaa 
atctcctcaa 
gctaccttaa 
gaatcaatcg 
aaatacctgt 



gttggctgtt 
atacagaatg 
tcacccctga 
aatccatttt 
agcactttgg 
cacatggtga 
acctgtagtc 
gaggttgcag 
ctgtctcaaa 
accttaagtg 
atatcaagac 
gacccacacc 
gttttgccca 
gtaatctcag 
ctactctgga 
ggtggtgcgt 
ctgggaggtt 
gagtgagatt 
atcccagcac 
ctggctaaca 
ttagccaggc 
aatggtgtga 
gcctgggcga 
tacattgtgt 
tctacactgt 
actacgggac 
tctggctgtt 
tgcactcatt 
gttggctttg 
tagagcggaa 
cggtttctgg 
ggcttctcct 
atgaacccgg 
atttaaaggc 
nnnnnnnnnn 
cagcagtggt 
agacattttt 
ggctagagag 
atccagtcaa 
gagactgtgc 
ttttagaaag 
agactggagg 
agcaggtgac 
gaaagcacta 
gtcctgcctt 
agtgaagcag 
gagatgtgtt 
catatgtgtg 
gatctgcgta 
tgtgcagaat 
tggaagaata 
agaagggaag 
gtaggataag 
gggcaaaagc 
ggaggaagag 
gctttggtgc 
atcatccata 
ccatcccact 
gcgctacctc 
cactgaaaat 
tatattgcct 
acctccctgc 
atgatccaga 
agaatcgaaa 



ctccatcctt 

gggagcaaac 

actgcacccg 

aaaattaaag 

gaggccgagg 

aaccccgtct 

ccagctattc 

taagccaaga 

aaaaaaaaaa 

tacagcccaa 

acattccagg 

tctcctgctc 

ttcttgagct 

cactttggga 

caacatagtg 

gcctgtagtc 

gaggctgaag 

ctgcctcaaa 

tttgggaggc 

cggtgaaacc 

atggtggcgg 

acccgggagg 

cagagtaaga 

actttttggc 

gtgtatgaaa 

aatttgctta 

atgaataaag- 

tcgcttatgt 

taatgtgctg 

catgagggtg 

ctcacacatg 

ttggcgttgc 

gcttctgcaa 

ttcagttcan 

nnnnnnnnnn 

tctcaactga 

tgattgtcac 

gctgctaaac 

aaatgtccac 

ctgagagcaa 

ctcattttgg 

cagggaaagt 

taggaaagtg 

taactaacta 

gcaggtctag 

gtttgtgcgt 

ggactcacaa 

agtctgcagc 

tatatttggt 

taggagagac 

agaggagcct 

cagttcttaa 

aactgaagaa 

agttttggtg 

ggtgttgagg 

aaagcagaga 

tcctggctca 

tctcctctgc 

ctcaagcaag 

gtgtgtctct 

tgttttacta 

aactagagat 

gcctggtaga 

taaatgcatg 



gaaaaatttg 
ttgggaagaa 
gaccatcagt 
tatatcgggg 
tgggaggatc 
ctactaacaa 
ggaaggctga 
ttgtgccact 
aaaaaattaa 
tgaaatttta 
aactcagact 
cggtggtaat 
tcacacagat 
ggccgagacg 
agacccccga 
ttagctattt 
tgagccgtga 
aaagaaaaaa 
caaggcgggc 
ctgtctctac 
gctcttgtag 
cagagcttgc 
ctctgtctca 
atctggttta 
ggttggttct 
tccgtattcc 
ttgctatgga 
aaatatcttg 
acttggttat 
tctcttcagg 
ttgtgactga 
ctactgttgg 
aatacattaa 
nnnnnnnnnn 
nnnnnnnnnn 
gcatagtttt 
agcccagccg 
atctaacaat 
agtgttgaga 
gaaggagtaa 
ctgcttgtag 
aatttgggag 
gcagaggcaa 
atgtgtggat 
ttggaaggtg 
gtgtgtgtgt 
tgtccatggg 
tcagaaacag 
agcttgagcc 
gtgcaccaag 
gccaacagaa 
gaagggggaa 
tgtctgttgg 
gagcaatagg 
aagtggccag 
agccagctca 
aattccagca 
ttatcctatc 
ccttctctgc 
tccattcatg 
atatgctcgt 
tctctttaag 
ggcttgtgtc 
tgtgctctga 



gttttagggt 
agaaggcaca 
tatctctgtg 
ctgggcgcgg 
acgaggtcag 
tacaaaaatt 
gtcagaagaa 
gcactccagc 
agtatgtcat 
cacatctata 
ccatcatacc 
taaccactat 
atacattgtc 
ggagtatcac 
ctctacaaaa 
gagacgctga 
ttgcaccact 
tatggccggg 
ggatcacgag 
taaaaataca 
tcccagttac 
agtgagccga 
aaaaaaaaaa 
ttttgctcaa 
ttttgttgtg 
tatcggtggg 
tattcttgta 
ggtggaatta 
gctgaattcc 
gaatctggag 
actgctggta 
ggcaggtgtg 
caacgacaga 
nnnnnnnnnn 
nnnnnnnnnn 
gcctcagagg 
agaaggtact 
gcacaggaca 
ggtttaggta 
ttggaaagtg 
acagtgcatc 
ccactgaaat 
tggggatggg 
gatgggcagg 
atggtttctc 
gtgtgtgtgt 
acatccaagt 
gcctggggct 
acaagagtag 
aagccaggtg 
attgggaggg 
gtgaagaggt 
gtttggcaat 
gataacagaa 
cgagtctaca 
ctcattgact 
ctaccaggag 
ctatctgtca 
ctgccgtcac 
ttagttctac 
tctgtttgcc 
tatttgttga 
catggtggat 
tctaaactca 



47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
48480 
48540 
48600 
48660 
48720 
48780 
48840 
48900 
48960 
49020 
49080 
49140 
49200 
49260 
49320 
49380 
49440 
49500 
49560 
49620 
49680 
49740 
49800 
49860 
49920 
49980 
50040 
50100 
50160 
50220 
50280 
50340 
50400 
50460 
50520 
50580 
50640 
50700 
50760 
50820 
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gctaaacttt 
tatgtaatgg 
acaaggtcct 
cagtaagaaa 
tgtgtaacaa 
tcggagaagg 
agtggagctg 
gacaagagcc 
actatgtggg 
aaatatggtc 
ggaaggtaga 
actgagtgtt 
ttcctgagga 
ggggccttcc 
aatggggaac 
gccattcacc 
gttggccaag 
gtcacttttc 
ctggcttcct 
tttctcagtg 
cccgttcctg 
agggagatgg 
tttagcagct 
cccagcatgg 
ggctgtgtgc 
tctgttccct 
tgctctcagg 
tgaagccagt 
cttcaggaag 
ggcagttcct 
aatccatcat 
tgcctgtcaa 
gttcttccag 
tcaaagaatc 
taattgcagg 
ctgcagggtg 
ctggagctgt 
aggggcatcc 
tggtgcacac 
cagtaccagg 
cctaaagttg 
ctgtcaggtc 
ctccaagctc 
cagtgatgag 
gggtatcgtt 
ttgaaattca 
agagcctggg 
caaaagtaga 
ggtagcattc 
gtcagggctc 
caggactgaa 
tggagccaaa 
gaccttgtgt 
agtgcctctg 
gaggccttct 
tgcttcttct 
tattaggaac 
tggaatccaa 
acatgtgaat 
acttcataaa 
gcagctatta 
agccaccctc 
gctaagctgt 
tgagggtccc 



ctccaggggg 
aaaaactcct 
tgttgtcatg 
gcaagattat 
aagtaccaaa 
cttatctgag 
ggaccaggat 
tgatcatgaa 
aagactttga 
gcaggggcta 
agatggggca 
catggtctct 
agcgggaaga 
agatctcaca 
tgttgtccag 
cttgcagtgg 
cccagttaac 
tgagggtggg 
atggtctgag 
tccatcttac 
tgggaccctt 
gccttgaggg 
tcaaacaact 
ctaggttttc 
tcatctggag 
ggagttgcag 
tcctcgaggc 
gatggagaat 
agagccctgt 
tttccttaaa 
atttacagtc 
aagggccaag 
aaaaaaatca 
tcatggaggg 
gttcatggtc 
tggggaggga 
ggtggttttt 
caagagctcc 
atgtttttct 
tctgaccctg 
ggatgagccc 
ccctttggaa 
tgtagtaact 
tcaacatttc 
ctcttgctca 
agtcctgagc 
caagtgggcc 
catcagtctc 
atttactaag 
ttaatggcag 
aggaaacgga 
gtcaggctgc 
ctggcaccca 
tgggtactgc 
tggcctccat 
ctttgataga 
ttttgtcttc 
gaaaccacag 
tatctcattc 
tgatgatatg 
agtgctagac 
tcagacactg 
ggatttcaga 
tgaacagtca 



taaagttcaa 
tctatgacct 
aagtttgctt 
atcattttgg 
taggagagtg 
gaggtggcct 
gacaagggtc 
agactttgca 
atgggaagca 
gtgaggtggc 
ggggaggcaa 
gggagcagag 
acccatctga 
tgccaatcgt 
gccgatttca 
cgttagctct 
aagcagttga 
gagagaatcc 
ggagagaatg 
ctctctgctg 
ctctgccaat 
ctgtattagt 
catgtttatt 
tgctcagggt 
tttagggttc 
ggctgaggtc 
tgcccacatt 
ttcccttgga 
ctcttttatg 
gtcaactgtg 
ccagggatta 
caggacttta 
ctcagtaaag 
tttggacctt 
ccttgaagcc 
tgcaggtgga 
ttagtcttta 
ttggtattga 
cctgttgcaa 
gacagcttgt 
cccgagaatt 
gtgctggtat 
ttagggctag 
ttggggaagg 
aaggcactgg 
acaccacagt 
ccttgtgagt 
ttgaaagttt 
caattactaa 
gcagcagaaa 
gtagctcatg 
agaacaggtc 
ggatgttgta 
aactccagga 
ctttttggtc 
gcccagtcat 
aaaaatgaga 
gaaggggtga 
atttctcaca 
aggtacagaa 
ccagtcaatt 
ctgtatactt 
tcctggattg 
actggtttcc 



gttgattagt 
gggcagagtt 
tctgaaggag 
taaatgttct 
gggtgggtgg 
tttaaccagt 
acttgtcatg 
gatggtggta 
tggggacaat 
agcagagata 
ttactgcaaa 
gttcctggag 
gacgtgggga 
cttggtctat 
cagaacaacc 
ttcaccagct 
tcagccccag 
tggagcagaa 
gtgggatctc 
taccttcgtt 
ccctacaccc 
cttctattct 
agctcatcgt 
ctcacaaggc 
tcttccaggc 
ctgttttctt 
gcttgccacg- 
ttgaatcacc 
ggatcacctg' 
gcatgtaaca 
tgcacagtgc 
ttggtgaaga 
ttagaggttc 
caccctagaa 
tattcatagt 
ggtgagggct 
agctgtcatg 
caccatctcc 
cagcccactc 
accaggagct 
agatcagaaa 
ggagaggatt 
aaaggaggat 
aggcagggct 
accccaaggc 
tgtgatgcag 
accgttcaac 
gattaatgct 
ataccagttt 
ctctccttgg 
aaattgcagg 
tagggaggat 
gggctcagac 
actcaatctt 
actagctcca 
atgcgttagc 
ggcctgtcct 
ggtgactggg 
ctacccagtg 
agtttaagga 
tgagtctgac 
ccagtgaatg 
tatgacctaa 
aagaatgggc 



caattgatta 
ataggcagtg 
agagataata 
tgtggaaata 
gcttctttta 
acaaatgctt 
ccagtgagtt 
atgggtttgg 
ggcctgtgat 
gggagaagta 
gacatattcc 
gggaaagagg- 
ctgtgctggt 
gtcaattgtt 
gcccagtcca 
tttatctgcc 
agatcaggtc 
catgtaacta 
tggcctgaat 
attttccagc 
actgtaaatt 
gcataacaaa 
gagttcatca 
taaaatcaag 
tcacgtggtt 
gctgactgtc 
tgcgtggtct 
cacatggttg 
attagatcat 
tcacacaacc 
accaggggac 
acagtggaat 
tcttgccttt 
acatcacacc 
ttccaggttg 
gaatagtgtg 
tgttgggggt 
aaggtgatct 
ttgtagaaga 
acagcacact 
agattaaatg 
gactgagtct 
gcctaagatt 
gaggattaaa 
ctccagctct 
ggaaagaatg 
ctcatttatg 
ggtcacactc 
ctgtgctaaa 
ctgatctaag 
aagggccgga 
cccactgctg 
cctggatcaa 
gtcaacgcca 
gattcaaaat 
tgcaaaggaa 
ccaccaagat 
cagctcacag 
aggtaggtat 
acttgcccag 
ttggactgtc 
ttgatgaaat 
aagagagact 
tccctctcat 



attaattcat 
aacaagacag 
aacaagaaac 
aatgtgatga 
gaaagagttc 
tagcttggcc 
tgagcttgta 
gttaattgct 
acatgttatc 
gacggactgg 
ttctaagctc 
ataatgtcac 
tcgtttctaa 
ggggcatcca 
tatctcccga 
ccgtggggat 
cctggagtct 
gaagggccac 
caaacctccc 
agctcctcag 
tcaccgtggg 
ttgcctcaaa 
gcagtgtggg 
atgttgtctg 
gtggcagaat 
agatgagggc 
tttccatcct 
gactctctga 
acccatagag 
acaggagtaa 
aactgaattc 
gtcattcttg 
tgggaagtca 
atgttttcta 
aaaagctctg 
agctgcatat 
tgggcatggg 
ctgctctgcc 
gcagacccct 
cccccacaag 
cagaggtgat 
gtttaggaac 
caggatcctg 
cggagatgat 
tcgctcccat 
tgcttatcag 
tcattggcac 
aaagaccctg 
tgctgcatca 
tagaaaaatc 
aaaccagaca 
ctgagaccta 
tgtatcctgc 
ccgccagaga 
cttgaataga 
gctgaaaatc 
ccataggaaa 
catgcatgct 
tgtcatccct 
gacacgacac 
tgactccaga 
tttcagggtt 
tccctaggag 
caccttatga 



50880 
50940 
51000 
51060 
51120 
51180 
51240 
51300 
51360 
51420 
51480 
51540 
51600 
51660 
51720 
51780 
51840 
51900 
51960 
52020 
52080 
52140 
52200 
52260 
52320 
52380 
52440 
52500 
52560 
52620 
52680 
52740 
52800 
52860 
52920 
52980 
53040 
53100 
53160 
53220 
53280 
53340 
53400 
53460 
53520 
53580 
53640 
53700 
53760 
53820 
53880 
53940 
54000 
54060 
54120 
54180 
54240 
54300 
54360 
54420 
54480 
54540 
54600 
54660 
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cagtaatcct 
cgcagagccc 
ttctgtcttt 
aaatgcatcg 
atgaaggaaa 
tcaagctgtt 
ccaggtagga 
gacttccttc 
tttgacctca 
tgcttcactc 
atgggagggg 
ttacgtgagt 
atgtgccggg 
caccatcacg 
aaatgactaa 
atccaataag 
ttggaaaatg 
taatacttgc 
tctcgcccaa 
gttcaagtga 
atgcctgact 
gtctcaacct 
caggcatgag 
atgttaccat 
aagttagaca 
cccatcttcc 
cgctagacct 
ctttaataag 
aatgcaaagt 
cccctgcttc 
catcaatggt 
ttgtaaaact 
tcacagaaac 
taacattctg 
attgttttca 
tataggcgca 
ttttttattt 
accagtaaaa 
gcccaggata 
gtatcgggag 
ggtccttatc 
gttcctccaa 
tttgggaaga 
gtggagctgg 
agggctggca 
tcatcctctg 
ggagatggtg 
taccttctaa 
ttatgtcata 
tttacttccg 
gtttagaaag 
gcttttctgg 
gtcccctaat 
ctgtgttctc 
tttggggcag 
ctgttttttc 
tcgtcttcta 
tgaaaggatt 
aattttgtcc 
ctctcttagc 
ggtggatgat 
accaccttga 
ccttcccagg 
ctggggagga 



ctgtccaaca 
agctcaaagc 
ttaaatcaat 
ttcctcgttt 
ttacagtgtc 
ccttctactt 
atcagctcct 
tgaattagca 
atttactcat 
tgattttatc 
agagaagttt 
acgttactta 
cactgtacta 
aaggaagaca 
ctgataagga 
ggcagtgttg 
gggtatcaac 
catgaacagt 
gctggagtgc 
ttctcctgcc 
aatttttgta 
cctgacctca 
ccactgcatc 
aatgaaggct 
tacgtgaaca 
atcacagtca 
attagccaag 
gcagcctcaa 
tataaaaaga 
tggaggcaac 
cacataagga 
attcacatac 
atctcttcag 
tagaatagag 
tttctacaaa 
tttatgccta 
taataggtat 
caggggagat 
agaaccattg 
agaaatcttc 
agccttgcta 
ggcctctata 
tctaaagacc 
tgacagttta 
ggcgtctacc 
ccttgcacat 
gagacatgga 
gtcctatgtg 
gactctgagg 
ggcctgaagg 
aggtccttgc 
cccatcctga 
ggtgtgattg 
cgttgcagga 
tttctgctct 
cctttctatg 
tttctgaatt 
tctcaaggcg 
tttctccatt 
atcgccatga 
ggaaccagta 
ccttgtcagg 
cagagttgag 
acaaggggag 



gccaaagagg 
tcctgactag 
ggagacaggg 
ttctcattcc 
ctgtgcatat 
gaaatgcctg 
tggttcatgg 
gaggatgttt 
cttacaaatt 
tccacctgga 
ggtgccagaa 
accattaatt 
agcaccaagg 
gatgttaatc 
caagcccctg 
cggggagatc 
cacctacctc 
attttgttga 
agtggtgcaa 
tcagactccc 
tttttagtag 
aaagatccac 
cagccacttg 
ggtaggttgg 
tttcctgtac 
gctgggaagc 
cagaagtttc 
aattacagat 
agataaacaa 
aaggtcaact 
tgctcacaga 
taaatacttt 
gtttatagat 
aggacacatt 
tagtcaagga 
aaggatagtc 
ggacagattt 
atgtattttg 
cccagagctg 
actgaatgaa 
tctccctctg 
taagaagtct 
caggaaggtc 
ttctcccagg 
tgtggctttc 
tccttctgcg 
ggagggtgtg 
ttcctaggaa 
gacccattaa 
aaagccagcc 
gtcctggatc 
tttctattca 
tgatgtgtgt 
taaagccaag 
tcatctaag-t 
gtagatgggg 
tctgtgtgct 
tctgttatgt 
atgcttactt 
agtaggtggt 
cccaagccca 
cagcttgggt 
cactctgatg 
atgtgaagga 



tcctgtgggg 
gctcttgttg 
gagggttatc 
ttcccaatgt 
accaaggctg 
tttccttccc 
agccttttct 
cctagcttgg 
aggttgtaag 
tagcatcatt 
agaactggat 
acttcaatgt 
atacaatggt 
cattaaccaa 
gagctacaag 
aggagccaca 
actaggtttt 
ttgatgattg 
tctcagctca 
aagtagctgg 
agacaaggct 
ccacctcagc 
ccatgcatgg 
ccaactgagt 
ttgaagatcc 
ttttaaaaaa 
tgggcagggc 
tcagcacgca 
caatctgtct 
atttggtgtg 
taagcaccta 
cctcagtatc 
acaggtccag 
ttactcagtg 
ataacataac 
tcaaaagagt 
gttctcaaaa 
gaaaagcacc 
ttccagatgg 
tgagtgggct 
acagaggcaa 
ttgtgagagg 
tctgggtggg 
aggtccctgg 
aggttatgag 
ggatgtgaaa 
ggtggcttct 
agccaataat 
gtacaaacaa 
tcagccaccc 
ctggagcatc 
ggccttcttt 
gcatgtgtgt 
atgaaactcc 
aagtgttttt 
tcagagttac 
taaagggatg 
ctgtggtctt 
tgagcttact 
attgtatacc 
tgactgcccg 
tcccctcata 
cccagggcaa 
aggacactta 



agggcttgca 
agtattcctt 
tccatcctcg 
gtgtactgtt 
tccaacctcc 
ttctaattgc 
gctctgtttt 
tcttaaccct 
ctaattgaat 
tttgacacac 
ttgaattcta 
atatttatta 
gagtaaagag 
gtaatctcac 
agggtgtata 
cagagcctgg 
taaaatcagg 
attgaaacgg 
ctgcaacctc 
gattacaggc 
ttgcaatgtt 
ctcccaaagt 
catttaaaaa 
ggtctgattc 
tcaggacagt 
atgcagacat 
atctgcatat 
tttaccataa 
cctgctttct 
attcctttta 
tgcgggggtt 
ttgccttttt 
ctcttctttt 
tccgattgat 
tctgtaaaag 
gaaactgatc 
tgtttgtggc 
caaggcgatt 
cccctgggtt 
ccccagggaa 
actctctctc 
aagcaaagaa 
tgagtgcttt 
ctgtggctga 
gatgtcagca 
gtgctccttg 
tgaactctga 
cattgcttct 
ataagcgtaa 
ctcagggttt 
aggagctggg 
ttctccacct 
ctgtgtgtgt 
ccttacttct 
tgccttcagt 
acacccaccc 
gggactctat 
ggttctactg 
gagtgccttc 
catttcacag 
actctaagtc 
gagactgggt 
ggtgtgagct 
gctatcctcc 



gatgggagtg 
tgattcctgc 
gctcaagatg 
aactttagtt 
acacctttgc 
atctttccat 
actatgcatg 
tctccttttg 
acaggatcta 
aagcaggcat 
accctgttgt 
agtacctact 
atgcagcctt 
aagaaaagta 
cagggcatcg 
gttgtctcac 
ttaaatgagg 
agtctcactc 
tacttcctgg 
agccacccct 
gaccaggctg 
gctgggatca 
tgttcagtaa 
agaaggaaag 
gactcctaga 
ctgaccttca 
ttttaaaaat 
ccactgaaga 
tccctctcct 
gcattccctc 
ttttttttcc 
tcacttcatg 
catagccata 
ggatatcaat 
ttttattact 
aaatgtgcat 
agttcaaaac 
ctgaagtgta 
cctgaagtgg 
gtgatgaaat 
cctgggggaa 
ggacctgggc 
ctctgctgtg 
cagtttctgg 
ggggcagcct 
gctggggaaa 
ggaggggaca 
cccgcctttt 
tagtcccttc 
gctgcgttct 
cttggcatga 
cactcccacg 
caatgacaaa 
ggctcttcta 
ctttctttct 
ccttctttga 
ggccaggagt 
tgacattccc 
tctcctttaa 
aaatacagct 
catgctctta 
tccaggttcc 
gtctgtggtt 
ctgccagggt 



54720 
54780 
54840 
54900 
54960 
55020 
55080 
55140 
55200 
55260 
55320 
55380 
55440 
55500 
55560 
55620 
55680 
55740 
55800 
55860 
55920 
55980 
56040 
56100 
56160 
56220 
56280 
56340 
56400 
56460 
56520 
56580 
56640 
56700 
56760 
56820 
56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 
57420 
57480 
57540 
57600 
57660 
57720 
57780 
57840 
57900 
57.960 
58020 
58080 
58140 
58200 
58260 
58320 
58380 
58440 
58500 
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ctgagacttc 
caccagagca 
gctctggaag 
atatggtgga 
tccctgggtg 
cttgggtgag 
cccccagcat 
acctgagcct 
attttggtct 
ctcaagaccc 
ctgtagttta 
attaattatc 
ggcaggatca 
tcagccagca 
aaccagggat 
agagctaagg 
tatgtaactt 
gctgaggcag 
ccattgcact 
taaataaaat 
gagttggcaa 
tgcatagaat 
cccatcatgg 
atacggggcc 
tttctccagg 
tactgggctg 
ggtgaggtgg 
aggagatgct 
tctgactcca 
ggacaagagg 
ccactgcctc 
cagagctgcc 
ctccctgcaa 
cactctgctt 
catagaagtg 
atggagcctt 
tactgggcac 
cccagtttgc 
tggttggttt 
aatgaatgta 
gggcaccatt 
ctgccttccc 
aacacttgta 
gcttgccatg 
ttgggatata 
ttgagcaccc 
atcaataatt 
agacttgccc 
agcatgttac 
gggatgaagc 
caggacacca 
tctcaacccc 
atttttttct 
aggggtgaaa 
ctaaggtcac 
ttgttccaca 
agtgctgtga 
tctggtgtgg 
ttcccattta 
gtcttgttcc 
cagtccacag 
gggctcctag 
gaagtttccc 
tgaaaaattt 



cacctttgag 

ggagatggag 

tgaagatgcc 

caaaaacctt 

ccagacctgc 

tggcctatgg 

tgctatcact 

cagagaggtg 

ccaaccttca 

atgtgaattc 

cttgccggag 

gaatccagtg 

caggctcggt 

ctcagcttgc 

agatggacac 

atttagaatc 

aagttagctg 

gagagtcact 

ccagcctggg 

aaaaaaaaaa 

cccttgttct 

ctgatggtga 

tccagcaggg 

aggtctgctt 

gtcgctgcag 

ctcaccagcc 

ggctggggat 

acaaagagcc 

cagcagtgtg 

ctgaaagtcc 

agaagacttc 

ctctcctggg 

taaaatgggt 

agccctcatt 

tcagaagcta 

ttggagatgg 

ctgcatagtg 

atctctgtaa 

tatatgtaag 

taagtaccgc 

tcccagtggc 

ttcctcaact 

cgcaactccc 

cgtcagactc 

tggttatttg 

acaacatacc 

atgagcccta 

aagactgaac 

tttaaccctt 

acggggggag 

gcccagctgg 

ttctgtacta 

ccagcattcc 

caagaactct 

atagtaagtt 

agaatctgtt 

ggatgcagct 

aaaggcattg 

ttaagcattt 

tcataaatct 

agatcttctt 

tcccatcctg 

tgagcagtcc 

ttcttggccg 



acccctttgg 

gagacccctt 

tccaagaaag 

acgtgtcctg 

cgctacctcc 

ctgaggctga 

ggctcttttt 

ttggggtaag 

aatggttgtg 

atgtaggaag 

gtgctacagg 

ttctgtcagc 

aagagaagtg 

atcagatttg 

ccacagacaa 

aggaggtttg 

ggcatggtgg 

ggaacctggg 

caacaagagc 

ttaaaacaag 

accttttgtt 

taattgccgt 

aagggcccat 

ctctgccttc 

acgctttcag 

ctggtcccgc 

gaacgatgga 

tgaccctttg 

tgggtgggaa 

ctttctgttc 

ctttcatctg 

cagctgcctc 

tttactgaaa 

gaaacttctt 

ctactccttg 

aggaatgggc 

ctgccaggac 

ccaaaggcct 

gaagttaaca 

aatccccggc 

ctgctgtggt 

gcattctctg 

tctctcagga 

tgaatgaggc 

taacataggc 

ccctgctgtg 

ttttacagaa 

ctctaaatgc 

gtgcttttac 

gaaagccaga 

aggtgaggga 

gggaggtcag 

tccaattcat 

atttttttgg 

gagaaagaga 

gatttcaaag 

gaggtggaac 

cttcccttat 

actaggtgtc 

ctgccgctgt 

tgggggagta 

tatctcgacc 

actctatagt 

ggcacggtgg 



gtgctaagac 

gcagggagct 

atggggctgt 

aggaagagga 

tggtgagaag 

ggtgggagca 

cccattgagg 

aggggagggc 

gcaggggagg 

gatgctccat 

ggcaacctgg 

gcgctcaacc 

tgaacactaa 

tctgtttttc 

cactgagggg 

ggtccaagtt 

tgcatgtctg 

agacagaggt 

gaaactccgc 

accatgagtt 

agggctggaa 

ggatcagcat 

tgcccggtgg 

cctctgcccc 

tgggttgacg 

ggtggtcact 

aaggtctggg 

tgggagaggc 

aagaggccct' 

tgccttcaca 

ttcctactga 

ccctcctctg 

tggatttatt 

ccttatcatc 

aggaggagga 

tcagctagtt 

ctttcaaggt 

tttctctctc 

tgtccctggg 

ccctcaagtg 

agttggcctt 

ctccttccta 

tctccttctg 

ctgggaatac 

acaaaaacat 

gcaggcacct 

ccaggctcag 

aactcatatt 

agctggctac 

ctgtctcact 

gctttaatca 

cagaagaaaa 

agatcttcat 

agtgtggtat 

gagaaatcaa 

atggtggact 

tttcactcct 

ttcaatatta 

aggtactgtg 

gggtgtgtac 

atatcaagaa 

taggcatgtt 

gtgctttata 

ctcatgtctg 



gctgcctgag 
ggaggaagag 
tgagtctatc 
cacagtaaaa 
tcttcagacg 
tggaacgggt 
gccctggggg 
cacctacaaa 
gagggaatga 
tctttgtctt 
tttccatcca 
agggtcaagt 
atggggtgca 
tcctgtataa 
gctgcctggg 
cctttccatc 
taatcctagc 
tgcggtgagc 
ctcaaaaata 
tgtttcctca 
ggacaagcct 
ttcagatgac 
gcagcagaaa 
atcccttctt 
gcagccgctg 
gcgtggccct 
agatgggaag 
ttcctgggtc 
cctgtgggtt 
ggaggccact 
gctggtccca 
cttgccatcc 
ttctcctctg 
tctccccaca 
tggagggtgg 
ctcttcatag 
tgtaggtaga 
tctctccaac 
aacagtccac 
gaataaatct 
attccatgca 
ctctctgcaa 
gggaaacctg 
aagacatagt 
ctactagttg 
tgcctagatg 
agaagttagg 
gaaattcaac 
tctcccctta 
cttgggttca 
gaggggaggg 
taattcaatg 
atgggattag 
agagaaggga 
actcaggttc 
atgggttcat 
tgccctcttg 
acaacaaagg 
ctaaatgtta 
tttgacagaa 
ggggcacgaa 
tacattggtg 
ggagcacatt 
taatcccagc 



gatgaggaga 
gaggagtggg 
tcagtgccag 
gtggtgggca 
tttagtcaag 
gtgggatatg 
tgtcagtaga 
cagaagttgc 
attgtgggga 
ttatcctgcc 
caacttcaat 
ctggattgga 
cctgctgatc 
tctccagaag 
cattcaggga 
tctcactatc 
tacttgggag 
cgagatggag 
aataaataaa 
tctctaggat 
gtcactggga 
taggacagtt 
gagctggcag 
cccctcttgc 
gaactttgca 
gtgtacccga 
tgccccaagg 
ttttatatac 
gagttgggat 
ggcgtcgagc 
gccagcagtt 
ctccctccac 
atcgcggatc 
ccacaacttt 
agttgggtct 
aacacctgat 
ctcccaatgg 
cccagaactg 
aacattcagg 
aacatgtatt 
tttttatggg 
ctcccaaata 
atataagaca 
cctctggcac 
ttatcgctta 
acctcatgtg 
atctgtcaaa 
tctgctccaa 
tggtcacacg 
tcttgggaca 
aggaaggcat 
ttctaaagcc 
gggctcagag 
tgctacttct 
atttcaacta 
ccctgtggtg 
gactttatat 
gtaataatat 
ggtgaacttt 
gtttgacttc 
ggaagctgca 
cattcactgt 
gtacatccat 
actttgggag 



58560 
58620 
56680 
58740 
58800 
58860 
58920 
58980 
59040 
59100 
59160 
59220 
59280 
59340 
59400 
59460 
59520 
59580 
59640 
59700 
59760 
59820 
59880 
59940 
60000 
60060 
60120 
60180 
60240 
60300 
60360 
60420 
60480 
60540 
60600 
60660 
60720 
60780 
60840 
60900 
60960 
61020 
61080 
61140 
61200 
61260 
61320 
61380 
61440 
61500 
61560 
61620 
61680 
61740 
61800 
61860 
61920 
61980 
62040 
62100 
62160 
62220 
62280 
62340 
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gccgagacag 
aaccccgtct 
tcccagctac 
agtgagccga 
aaaagaaaga 
tgggctaata 
ctaagctctg 
cactgtgcag 
caagggccac 
gatnnnnnnn 
nnnnnnnnnn 
agagttatgt 
gttttcttcc 
ttttcccaaa 
gtttcttgta 
attgtgtagg 
gagactggga 
cagaatcatg 
gaagaatcaa 
tcacaagaat 
cccacccaca 
cacagccaaa 
acactattct 
gcagggacct 
gttgccacag 
cagtagtgtt 
cctgtcagga 
gagtcagtgg 
agcattattg 
gtttctgttg 
cttcgctgac 
ggttcctcca 
ctgctgtgcc 
accttcatcc 
acccttgcgt 
aagctggagg 
actgggtttg 
ttcatgcact 
c teg tat tea 
tgtcttccct 
tgtctggcat 
ggtcctgtct 
gtctgagcag 
caggggagct 
gggtccctga 
tttggcaagc 
cccatcttca 
aggatgggga 
gtgatggcct 
ttggggcaaa 
ctcacgaaac 
caggcccacc 
caggacctct 
ctctaggagg 
ttacgtgttc 
tactgeacag 
gtttaaatcc 
cacgatcttg 
ctcgatctcc 
ggcatgtgcc 
gctatgttgc 
tecaaaatge 
atetgaggeg 
gctggtgcct 



gcggatcacc 
ctactaaaaa 
tcaggaggtt 
gategtgeca 
aagagatttt 
ttccaaaata 
tgaaactgtt 
agcacagagt 
cctcactcaa 
nnnnnnnnnn 
nnnnnnnnnn 
tcattgattt 
tattattatt 
ggtaaaactg 
ccccttttcc 
tgtatataca 
agaaaaagag 
gcaggaggtg 
aagtggaaac 
agcgtgggaa 
atacgtggga 
ccatatcaat 
gcagagtttt 
ctcatgcctt 
cattcctgta 
acaaagagga 
gggtattgea 
ggttggagag 
atgagtttga 
ggacagtgga 
ttctgttggg 
ctttggcggt 
tgggcttege 
tgcaagtgat 
aagtggcctt 
taaagaaatt 
ggggcaggtg 
tgagecttta 
tcctctgagg 
tctttgtgcc 
gccatccatt 
gtttttccag 
agctgaagag 
ggggacaagc 
agctcactgg 
tctttgggct 
ccctcatcaa 
tccctggcag 
gggtatgtag 
agtatttgtg 
ttcaagcaaa 
ctggcctctc 
gtgetttcac 
tggtcgtgag 
agaggcacca 
etctatttge 
atcacttgga 
ctttgtcacc 
tggactcaag 
accatgccaa 
ctgggctggt 
tgtgattaca 
aatgacttga 
ctaccagctt 



tgaggtcggg 
tacaaaaaaa 
gaggctggag 
ttgcactcca 
ttctttttct 
tgggtttgtg 
tatctatggt 
aggggctcaa 
ctcatgtgcc 
nnnnnnnnnn 
nnnnnnnnnn 
cattttattt 
ttattttatt 
tgaaacaaga 
tcttctttag 
tgtattagtc 
gtttaattgg 
aaaggcactt 
ccctgataaa 
agactggccc 
attctgggag 
acatttccct 
tttttttttt 
ttaaccactg 
ctgatgacta 
tctggctaca 
gtaatccatg 
aagtgggcac 
ggtggggcat 
tggatagtgg 
ttggcctaca 
ggccctctgc 
cctctgtgcc 
cagccgctcc 
ggggeggget 
gggaggcaag 
tggagggggt 
catctgeggt 
cccacttctg 
cttagtactc 
tacatgectg 
ggccccgcat 
ggaaaggeca 
ggccgacagc 
aacccctctg 
ggtgatggcc 
aggctccctt 
gaggcctggg 
catggtgcag 
tgtgtgggga 
atcctgagac 
ccctaggccc 
ctctgtgtcc 
tctcctgctc 
gctaaggagc 
ctcatttttt 
gaaaagaaag 
caggctggag 
cgatcctctc 
gcttatttta 
tttgaattcc 
ggcgtgaacc 
ttgtgacatc 
cccatgtgac 



agtttgagac 
ttagccgggt 
aatcgcttga 
gcctgggcaa 
taaaaagtaa 
tgtgaatttt 
tctttatcat 
tcaacagtgc 
aggcatagtt 
nnnnnnnnnn 
nnnnnnnnnn 
tgtttctgat 
tgtaaaacat 
tatattcaaa 
gtaaccattt 
tgttttcatg 
acttacagtt 
ettacaeggt 
cccatcagat 
ccatgattca 
atataattca 
ctcttttaga 
gatgtaactc 
cctgggtctc 
tttggattgt 
tgttcagggt 
actgagttaa 
atttgaatga 
gtggggaaag 
ctcctcccct 
gagagcttct 
tcgacggtgc 
tcagtcccca 
ttcctctatg 
ctgtggagac 
gcggggcctg 
gagaccagag 
gccctctccc 
tttcagttct 
tagtcttact 
tcttttcttt 
gtgecttett 
gacaggtgtg 
ccccagaggc 
gtttctctcc 
ttgtcggctg 
cagaatgacc 
ccttaggcct 
cttcccaaag 
aacagacatt 
ctcaaaggtg 
acacagggtt 
ttacacctgg 
tttgcagaaa 
aaaagtcaac 
atttttaaag 
actaaatgtt 
tgcagtggca 
atctcagcct 
ttttattttt 
tggcctcaag 
actgtgcatg 
aggtgaccta 
cttgaacatg 



ctgcctgacc 
gtggtggcac 
acctgggagg 
caagagegaa 
aaatcatgaa 
cctctccagt 
tgaatccttg 
actcattget 
ctgagagctt 
nnnnnnnnnn 
nnnggtcact 
ttttaaagat 
ttacatatca 
gaagtttact 
ttattttttt 
ctgctgataa 
ccacatggct 
ggtggcaaga 
ctegtgagat 
gttaccctcc 
aegtgagatt 
taaaaggtag 
tatcctgagg 
cattacatgg 
ttccagtctt 
ggggaggggc 
tggtagttta 
tatgtaggag 
gattcgagga 
ttttccaatc 
ttttcctctc 
cttcgctggc 
tcctccctct 
ggagcaaege 
ggacacactg 
gaggcagtca 
gtggtgggaa 
tctgttttct 
ccagggaaga 
tectttgeta 
cctggtgcag 
ctgggttctg 
gttggagggc 
caggcttctg 
ccagtttccc 
tggtgtctct 
cattttacgt 
tggctgcccc 
ggtctgtgtt 
ctggagtagg 
tttcctgett 
tccacagttg 
agggatgetc 
ctgaggctca 
tttgaattct 
cagcaaatct 
ttttgttttt 
caatctcggc 
cctgagtagc 
ttgatagaca 
cgatccaccc 
gecaaaagag 
gtaatcagct 
tcattgaatg 



aacatggaga 
atgcctgtaa 
cgaaggttgc 
actccgtctc 
ataaggggac 
aagatactaa 
gagttcctta 
ttttcataga 
tgettaaget 
nnnnnnnnnn 
aatttggtat 
tgttttactt 
gacatttaca 
ttccctctct 
aaatataaac 
agacctatct 
ggcaaggect 
gaaaaatgag 
ttattcacta 
cccactgggt 
tgggtgggga 
tatactgtat 
gtgctctgta 
ctgeagcata 
ttgetattae 
agatgtgtag 
aagctaggat 
gtgaatgatc 
tgactcccag 
ttccttggcc 
tgttcgccca 
cctgacatcc 
ccagtacctc 
ggccttcctc 
gggcaaagag 
ggtgcgggag 
ggatagaaca 
acctggtgaa 
aatggaaaag 
gtgcgtgcat 
ectgeatgag 
tgggtcaaat 
aggectagga 
cttggaggga 
ttcagagcac 
gctccagttc 
gagtactggg 
aaatctggct 
attcaagtat 
gtggggaatt 
gtggtgagtg 
gccccaggga 
tgaggtcctg 
aagaggttac 
gtgttttgac 
tagaatagga 
gttttggaga 
tcactgcagc 
tgacactaca 
ctggggtttc 
gtctctgcct 
taaacttgaa 
gtgtattcta 
ctcgctaggc 



62400 
62460 
62520 
62580 
62640 
62700 
62760 
62820 
62880 
62940 
63000 
63060 
63120 
63180 
63240 
63300 
63360 
63420 
63480 
63540 
63600 
63660 
63720 
63780 
63840 
63900 
63960 
64020 
64080 
64140 
64200 
64260 
64320 
64380 
64440 
64500 
64560 
64620 
64680 
64740 
64800 
64860 
64920 
64980 
65040 
65100 
65160 
65220 
65280 
65340 
65400 
65460 
65520 
65580 
65640 
65700 
65760 
65820 
65880 
65940 
66000 
66060 
66120 
66180 
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ctctgtttct ttatctgtga aatgggcttg atattcctcc tctaccccaa ccgatagtgc 66240 

agaatgaaaa gtaactgaaa gtccttcctc cagggcacca tagtgtctgg gtgaaaagta 66300 

gaatataaac tcggtagact tctggtccct tcattggtca tggaatggac cagtgcttgc 66360 

ttcattgagc aacagttctg ttgttcagaa ttcctggatt tcacctcact tctgctctcc 66420 

ctgcaggtga atgtgatgtt catgcttgcc attcttctga cattcttcca cccctttctg 66480 

gtatatcggg aatgccgtac ttggaaagaa agtccctctg caattgcata gttcagaagc 66540 

cctcactttt cagccccgag gatggttttg ttcatcttcc accacctttg aggacctcgt 66600 

gtcccaaaag actttgccta tcccagcaaa acacacacac acacacacac acacacaaaa 66660 

taaagacaca caaggacgtc tgcgcagcaa gaaaagaatc tcagttgcca agcagattga ee J*° 

tatcacacag actcaaagca aaggcatgtg gaacttcttt atttcaaaac agaagtgtct 66780 

ccttgcactt agccttggca gacccttgac tccaggggag atgacctggg ggaggaagtg ^6840 

tgtcaactat ttctttaggc ctgtttggct ccgaagccta tatgtgcctg gatcctctgc 669 °° 

cacgggttaa attttcaggt gaagagtgag gttgtcatgg cctcagctat gcttcctggc 66960 

tctccctcaa gagtgcagcc ttggctagag aactcacagc tctgggaaaa agaggagcag 67020 

acagggttcc ctgggcccag tctcagccca gccactgatg ctggatgacc ttggcctgac 67 080 

cctggtctgg tctcagaatc acttttccca tctgtaaaat tgagatgaat tttggtgttg 67140 

aaagttcttc ctggagcaga tgtcctagaa ggttttagga atagtgacag agtcaggcca 67200 

ccccaagggc catgggagcc agctgacctg cttgaccgaa ggatttctga cagactatct 6 ;260 

ttggggatgt tttcaagaag ggatataagt tatttacttt gggcatttaa aagaaaattt .67320 

ctctcgggaa taattttata gaaaaataaa gcttctgtgt ctaaggcaac tactgtttcc 67380 

atctctctag gctttgggcc ggggctgtgt gtgtgtgtgt gtgtgtgttt gtgtgtatgt 67440 

gtatgtttct gaggaggccc taccctggca tgagagggta gggaatctgg ctacacatct 67500 

agtgtggcag ctggacccag aggtggggca ggaaccctga ctatgattca ccccgctggt 67560 

cctgggatgt gggcccagag acttcctccc* ccaggaaccc ctctgcttcc tcttcctctc 67620 

cacatcctta actaacttta gcagaaccct actcctcact acacaccccc agctagaagc 67680 

gctggatgga atcagaaatt cctagtttga gtttcaattc tgcccctcag cagctgggca 67740 

agccccttaa ccactctgag tcactagttc cccacctgca- aagtgcagtt aatcatttct 67800 

atctctgatg gcgattgtga gaatgtaaag tcattgcaac tgcctagcac atggtaggag 67 8 60 

cacatgaggg tttgctcctg tgtttactca tgacccttgg ggaggacggg ggcaaagagg, 67920 

gagaagttga gggtgcagga ggagagatgg caggtgggtg ggatgggaga atctggggca 67980 

cacctgctgt ctcattccca ccttgctagg agagggacta ggaaagaaca gtgggaggca 68 °™ 

gggggatggg ggtggaaggc agggggtggc aggcaggttc atccatccat tcattcaaca 68 100 

aatgtttatt gagcacctgc cacgtgtcag gccctgtcct gggtgctggg gctataaaga 68160 

tgcagaaggg tctgaaaccc agctcttcct tcttcctgtg gatgtcgggg tgtaatttcc 68220 

aggggccagg agcctgggtc tgagggcgga caccaaagtt ctagtggtgt ctattagcag 68280 

cgtttaaatc taatggatgg atttggtctt gttaccctgc tcaaaagctt tcagcagctc 68340 

cccactgtcc acaggacaaa aatccagatg ctagcctggc attcaaggct gtcactagtg 68400 

tgatctcaac ctctcccctt ccctctttac ctcctaccaa cagcggggca gagcccaccc 68460 

ctgtggacca agattcccag tctctgggtc tgtgtgtgca ccagttcctc tgcgtgggtg 68520 

gctcaccctg cctcagcttg tgaaatccat ctggtctgct gggatcctgc tcaaaatgtc 68580 

atcttctcca aaaatcatta ctcaggcttt ccagcatgtc tgagtccctg gcacttggtc 68640 

acacccttcc tggtgactgg catttgcctc cacatcatga ccctcccacc ccttgcctgg 68700 

gcagcatact ccaggaggca aggtctgttc tcgcctggct ctaattaatc tgtgcttacc 68760 

atccacatgg taccagctaa ttcttgttga atgaatgatc gttgaatgag tggattcttg 68820 

ttttggcctc agaaccaatt agaaggagcc agaaaaacac atgggggtgg gggaggtgca 68880 

gtgtggtgca gtggaaaaaa acccttctgg aaatctcagc tctgtcactt actttgtcag 68940 

ctctgtgact ttggatggac cacttctttg tcagtatggt gggagaaata gacatgcctc 69000 

tctgggctgt tgtaaggatt acaaattagg tcgagtgctt ggcatgtggt gggttgaaca 69060 

gatcacagct agcattacag atgatatatt aaagccaaaa aaagatgcct aatgtccacc 69120 

agttggtgaa cggacaaagg aaatgtacca tatttgggat attatttggc aatcaaaaaa 69180 

agtactgaca cctgctacaa cacggatgaa tcttgaaaac attagactaa gtgaaagaag 69240 

ccagacacaa gaaactgcta atgattccat ttaaatatga aatatcgggc cagggtgcag 69300 

tggctcatgc ctgtaatccc agcactttgg gatgccaagg tgggcagatc acttgaggcc 69360 

aggagttcgt gaccagcctg gccaacatgg cgaaaccccg tctctactaa aaattagccg 69420 

agtgtagtgg catgcacctg taatcccagc tacttggttg gctgaggcac aagaattggt 69480 

tgagcctggc aggtggaggt tgcagtgagc caagatcgtg ccactgcact ccagcctgga 69540 

tgacacagtg aggttccgtc tcaaaaaaaa aaaaaaaaaa ggaaaaagaa aaaaagaaat 69600 

ttccagaata ggccaatctg tagaggcaga aagtagattc atgattgggt aggcctgggt 69660 

gtggaggcca tgggtagtga tggctaatgg ggaaggggtt tcttttgggg tgatgaaaat 69720 

gggtggactt atggtatgtt aattatacct caataaaact gttatttaaa ggaagaaaag 69780 

atgcctggat tccccaggaa gtgtacagta gacttctgtg agaatcagaa atgatttctg 69840 

gggaagatgg gcgagaggag agtaagtggg agaagtgacc acgtgcgcaa ctctcatcgt 69900 

tctgccctga gagccttcct cctgcaactt tatttattta tttattttga aacaggttct 69960 

cactctgtta ccctggctgg agtgcagtgg tgtgatctca gctcactgca gcctcgacct 70020 
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gccaggctca 
caccacatct 
cactctgttg 
tcccaaagtg 
cctcaagtgg 
tcagcaacgg 
tggaggtgaa 
ggagaacaca 
ccttttaagc 
aaagtcaggt 
tcccaaggtc 
tcttcttctt 
tcttcttctt 
tcttcttcct 
cttcttcttc 
tcttgctcta 
ctggccttaa 
caccatgccc 
aaataaagag 
tggttaacat 
tgatctcata 
ctgctcacaa 
tccctttccc 
ttggtcagca 
gtgggcttat 
tcactatcca 
aggtgctgct 
gcgagaggga 
gagccttcct 
ctcactctgt 
ctcccaggct 
tgtctaggag 
ggtgtgacat 
catccaggct 
aatgtctgag 
gattttgagg 
tctctgtcgc 
cggttcaagc 
ccacacccgg 
tggtcttgat 
caggcatgag 
tgctcacagg 
gctggctaga 
tgtgacctta 
cagtttgttg 
ctggagatgg 
gattcctggt 
gagttgtaag 
agtgagtgat 
gttattagta 
ctggaatgcc 
tgtctccacc 
gcaaaaaaaa 
atcaatcttc 
tccaagcaca 
gggctgggga 
tgtccttgta 
tcagagagta 
taggtccttc 
tcaagtacgc 
cgctctgtca 
tcaagcaatt 
gccagctaat 
ggtctcaaac 



agcaatcctc 
ggctaatctt 
ctcaggctgg 
ttgggattat 
aaccaatgag 
gcttccaatg 
ttggccccca 
tgccgtgtaa 
ctattcagcc 
tcacccatta 
acattcattt 
cttcttcttc 
cttcttcttc 
cttcttttct 
ttcttcctct 
ttgcccaggt 
gcaatccccc 
agcctgcatt 
ggacatgaaa 
gttggtatgt 
agcttttcct 
ccccatctcg 
cttccctgcc 
gattggagag 
cacaaaagag 
taatctggtg 
aagaacactt 
gtaagcagga 
cctgccacgt 
cacccaggct 
caagcagtcc 
ccctttttcc 
gggcacagca 
tccaggcata 
tgtatcttaa 
tgttttgttt 
caggctggag 
gattctcctg 
ctaatttttg 
ctcttgacct 
ccaccgtggc 
cccctcagca 
gcagtagggt 
accaccctga 
caaggactag 
actgtccaag 
acctccgctg 
gattgagtgg 
caatataaac 
acatctgtgt 
actattatag 
gaacaaaaca 
aaaaagactt 
atattcaatt 
cagaaatgtc 
agagtgcagg 
ggagtcatga 
ataaggataa 
attaatggca 
tacatgacag 
cccaggctgg 
ctcctgtctc 
tttttttggt 
tcctgacctt 



ctgtttgagc 
ttatttattt 
tgtcaaacgc 
gggtgtgagc 
gaagcaagca 
caacccttct 
tcacaccccc 
gggacagcca 
cagggaagtg 
gcacattgtg 
agacattacc 
ttcttcttct 
ttcttcttct 
tcttcctctt 
tcttctttct 
tgaatgcagc 
tgcctcggcc 
tattctcttg 
aagacgcatc 
tccctcccgt 
tatgtaatca 
tggttctgcc 
tgtgtggggg 
tgcaaataat 
caccttggga 
ctaactgtac 
ctattctgtg 
gaaacaaccc 
ggttttgttt 
ggagtgtagt 
tccccaaatt 
tgccagcccc 
gtgagtcatt 
agatccagtc 
acccaggacc 
gtttgtttgt 
tgcagtggca 
tctcagcctc 
tatttttaat 
cgtgatcctc 
ccgcccaatt 
tattctttcc 
gtgtgggaaa 
acctcagttt 
agagtaacct 
accaacttcc 
cttcatctgt 
ctggatgaac 
cacctggctt 
gcccttcaga 
ttcaggatag 
acacaagttc 
tcgagttacc 
tttgcgatac 
aattaccagg 
cagacagatc 
agcatcttag 
gtaaggctct 
gctactaatt 
tgaaacagtt 
agtgcagtgg 
agcctcccga 
atttttagta 
gtgatctgcc 



tcctgagtag 
atttatttat 
ctggactcaa 
cactgtacct 
actcagagct 
ctcctgtaac 
acagtgccaa 
acagcataaa 
catatgatca 
gggctggtat 
tttactttac 
tcttcttctt 
tcctcttctt 
cttcctcttc 
tcttcttctt 
atcatcatac 
tccaaaagtg 
taagaaagat 
accattaatt 
catttgactg 
aatagtagcc 
acgccttgtt 
tcctagatga 
aaaaacactc 
tattccagag 
tttagctgaa 
agaatcagag 
acaggcacag 
gtttgtttgt 
ggcaagatca 
caaagcttgg 
tcaggggatt 
cctctgacat 
ctctggtgat 
atggcttgtg 
ttgtttgttt 
cgatctcagc 
ctgagtagct 
agagactgtg 
ccgcctcggc 
ttgagttttt 
tgggtccagt 
gctctgggct 
cctcacctgt 
tgggaataaa 
agttcctccc 
aaaatggagt 
gtcaagcact 
gtagcaggtg 
gcgtgcacca 
aaaacctccc 
tttattactt 
ccaaatctta 
ttttgtctcc 
ggcagggcta 
tgggtcctgt 
tgcctttgtt 
acgtgaaaag 
tttattacat 
tttttgtttg 
cacgatcttg 
gtagctggga 
gagacggggt 
caactcagac 



ctgggactac 
ttatagagat 
gtgatcctcc 
ggcacctcct 
ttcacaagtt 
cagcctcagt 
gctgggccct 
ataggaattg 
gccccatttc 
ttaaaccagg 
atttcttctt 
cttcttcttc 
cctcttcttc 
ttcctcttct 
cttttttttt 
ctaaatgcag 
ccaagatttc 
atcatttaaa 
ggaccactca 
gatgtatgtg 
aaaaacatga 
tatccccatc 
cggtgagcca 
agaaggcgag 
aatgacctca 
ggtgctggca 
atgatttcta 
ctctcatctt 
ttgtttgttt 
tggctcactg 
agtgatggtc 
gatgactctc 
tctttgggaa 
aaggagttca 
ttcacaccag 
gttttttgag 
tcactgcaac 
gggactacag 
tttcaccatg 
ctcccaaaat 
atgttctaat 
gtcacctccc 
ttgcaggcac 
aatggaaata 
aggtagcagc 
cacacaagct 
aacaatagga 
tcaaagggga 
tgctgtgtgt 
cacttcacac 
tgcaagcact 
ggaatgggaa 
agccaaagtc 
ccagcagtca 
tgaattcctt 
tatcacgttc 
tgctacctat 
tgctcggccc 
acgcaaaatc 
tttgttttga 
gctcaccgca 
ttacaggcat 
ttcaccatat 
tcccaaagtg 



aggcgcatgc 
tggggagtct 
caccttggcc 
gcaacttctt 
ttgatttcaa 
agagaggaac 
tccatcaggg 
tgtgatgatc 
atagatgaag 
tctgtctggc 
cttttcttct 
ttcttcttct 
ctcttcttcc 
tcttcttctt 
tgaggtgggg 
ccttgaactc 
aggcatgagc 
acagacgaga 
gagataatca 
ataatttaaa 
ttttaaatgg 
caccccctac 
gagggcagcc 
ctgttgtcaa 
tacccgctaa 
ggtcctgccc 
gggaaaatgg 
tctgccctga 
cagatagggt 
aagcctcgac 
ccagtggtta 
aaatgcttca 
gaacattttc 
cagacaggac 
accctccagg 
acagagtctc 
cttcgcctcc 
gtgtgcacca 
ttggacagga 
actgggatta 
cccaaacatc 
aggcctgcag 
tgatcagctg 
ggtaccacgg 
agcttgggct 
ctggcactta 
atactttata 
cctggcatgt 
ggctgcaggt 
cttgtggagt 
cgctttagct 
aacttcaaag 
aatgaaaaat 
atggagagaa 
tcagagccct 
ttagattggg 
aatgcctacc 
tggcacatag 
acattacagg 
gacagagtct 
acttctgcct 
gtgccaccac 
tggccagact 
ctgggattac 



70080 
70140 
70200 
70260 
70320 
70380 
70440 
70500 
70560 
70620 
70680 
70740 
70800 
70860 
70920 
70980 
71040 
71100 
71160 
71220 
71280 
71340 
71400 
71460 
71520 
71580 
71640 
71700 
71760 
71820 
71880 
71940 
72000 
*72060 
72120 
72180 
72240 
72300 
72360 
72420 
72480 
72540 
72600 
72660 
72720 
72780 
72840 
72900 
72960 
73020 
73080 
73140 
73200 
73260 
73320 
73380 
73440 
73500 
73560 
73620 
73680 
73740 
73800 
73860 
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tggcatgagc caccgcacct ggctgtgaaa cagttttatt gtgtttctgt ggaatgtgtc 73920 

ctacccaacc tatagctaac tcctatagtt ccctcagttc tcagctcaga tatcccttcc 73980 

tttctgtact gttacctagt actggttttc atagcaccag gtacctctct ggcatagagc 74040 

ttgtcacagt tgcagtttaa tgtaccatca taggatttta aaaatattca gttgtgtctt 74100 

ccattaggct ttcatttggg aactccacgc aggcagcagc tgtatatttt gtattgccta 74160 

ctgtatcctg agaactttgt accctactta gcacagaatg gaggctcagt aaatactgga 74220 

catgagagag agagagagag agagaggaga gggagagaga gagagagaga ttcaacctac 74280 

aatcccagct ctgagcttct agttccctga tggtgaggac tgtgatgtgt ctcacacggt 74340 

aatgagcact tatgcagaag aggctcagaa aatttctcct catggccaac ggaagactta 74400 

gagttctttt ccaagctcca ccgtttgctg gcatgcaaaa tttggactat cacttaagtt 74460 

ttccaagcct tgctttttct atccctaaca taggacaata ttcagcattg ttgtttgttt 74520 

gttgggggca ccatgtttca ggcacttagt agattattgt accaccacat ttcaattggt 74580 

cctcctcaag ccctgcaaca tctgtgaggt ggtcatcctt aacaactcac agatgagcaa 74640 

caggagactg gggggatgag ggaactgcca aggaggtcca gcttatgggc agcagagcca 74700 

agaatggaac cagggtcttt tattttttta tttttttatt tttatttttt aaccagggtc 74760 

ttttaacatc cgaggaccac attctttgtg ctttccaaat catcacctgc cccatgcaac 74820 

ttacagggta agttacatta aacaacgtat gtaaatggct ttgtgctagt tattcaccac 74880 

cacaggggaa gtgagtcacg gacaagagtg cagccgctcc attcggatcc tggctctgac 74940 

acttacctgg aaaatgactt aaccattccc aggatcagct gtttgtctgt aatttaggta 75000 

gtttaatggc acttgtgtcc tagagttgtt tagaaggttg aataatatgg agcacttaac 75060 

atacttagca cctagaaaca cttcctaaat attagttgct gctgttgtta tcgttattaa 75120 

aatttctgcc taagatctca tttcagggag cccaactcaa tctttgacaa gcttaaacaa 75180 

aaattgcttt tcttcattta ttcacttaca cagcaaacat gaattgagcc tgtactgtgt 75240 

ttccagaact gtgcaggacc agagaggcac aggtgaagga agcaaggctc tggctctact 75300 

ggggaaacag caagaagatt gctacaatga ggtgggaaga gggctggact agagagaagc 75360 

cctgattagt gtccttgcta cctttctctg ggagagccaa ggcaggcttc ctggaagagg 75420 

tgatccttgg ctgaaacttc gatgaagaaa aggaaagagc- gcagtggtta gggaggaaag 75480 

ggcattctgg gcagatgaaa tgacatgtga caaaatatgg gtgatcannn nnnnnnnnnn 75540 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 75600 

nnnnnnnnnn nnnnnnnnnn nnnnnnngca ctccagcctg ggtcactgag tgagagaccc 75660 

tgtctcaaaa aaattaaaaa aaaaaagtcc agaagaacat ttgggtctca ctctgtggcc 75720 

caggctggag tatagtggca caatcatagc tcactgcaca ttcaaactcc tggcctcaag 75780 

tgatcctcct gccttagcct tgaaataagc ttggattaca gatgagccac cacacccagc 75840 

cagaccatta ttcataatag ccaaaatgtg aaaacaaccc aaatctccat caactgacaa 75900 

atggataaat agaatggtgg ttgatccata caatggagta tttactcagc aataaaaaga 75960 

agtcctgata catgctacaa ggatgaacct cgaaaacatt atgctaagtg aaagcagcca 76020 

atcacaaaag gctacatatt acaagattcc atttaaatga aatgttcaga ataggtaaat 7 6080 

ctaactttta tcacaggcaa agctatgaca ggaaatagat gagtggttgc ctagtgcttg 76140 

ggggcagagg tgggggtgag gcgagtgagt actgctaatg gtacagagtt acttttgggg 76200 

ataaagaaac tgttctgaaa tggactctgg tgatggttgc actactctga acatactaaa 762 60 

actgttaaat tatatacttg aaatgggtga cttgtgaggc atggaaatta tatcttaata 76320 

aagctgtttt acatatttta catatttaaa aatgcaggtg gagggatgag ccctctaaag 76380 

agaagcagga gtttgaggag gttctaaata ttgtgtggtg ggtactgagg catataaatt 76440 

tgtcagacct catcaaaatg tatgatgtaa tcttcaagaa agttgatttt aaagaaacac 76500 

caccagcacc aggtggagaa ggcaggaaga agttacacaa ggggtaggcc aagagtggtg 76560 

gctcatgtct ataatcccag cactgcggga ggccgagctg ggtgggtgac ttgaggtcag 76620 

gtgttcgaga ccagcctggc caacatggtg aaagcccgtc tctactaaaa atacaaaaat 76680 

tagccagacg tgctcgcgtg aacccagggg gagaaggttg cagtgagcga agatcatgcc 76740 

aatgcactcc agcctgggtg acagagtgag actctgtctc aaaaaaaaaa aaaaaagtta 76800 

cataggggac agtggcaggt gtcaagggca ggcagggtct ctcctatctc caggataaac 76860 

tcatagggga cttagatgcc atgtgggtcc ctaatagccc tccacttggt tcttgcagcc 76920 

actcttatgt gtatcatttc atgtcaggcc tcttcttccc aacccaccca gccatcccag 76980 

cctggctgcc aaccccacct cctccagccc ctgtcacccc ataattgggg ccaggaggca 77040 

tgggagagtc gccatctctc ggtgccatct gttgcatctt tacagataac catggctgga 77100 

tgcggcagat cctggggtgg agcagccgct gttcagagca gtgatcaaga cctccccatc 77160 

tccacccctc aaggaatcgg ttttcttcca tagccacatc aggtgctgtg caggaaggag 77220 

ttgaaacgag aagccaggag caacgagaag gacactaaca tttattaagc actgcagact 77280 

ctcacagcac tcccacggaa tcgatattat tatccccatt ctgaagacca ggcaactgaa 77340 

gctcaatgtt taaggaactc accgaagtca ccaactgata aaagtgatgg aagctgggat 77400 

tcaaatccaa gctaaacttc cttccaagct tactccacaa cacagaggtt ggggaaaggg 77460 

gataaaaaga gaggggagcc caattccatt tccacccagc tcctgaggcg gagcttgtca 77520 

gcacagctct ctccttccca gaataggaag atacccatca gaggcaagtc ctagacacca 77580 

gcagtggtaa ctccctgccc caaggcagct gcagacagcc tatggctgta gttactgctc 77640 

ccaaagagtg ttagaattcc cactcccagc ttcggggcca ctcacacaag gtgattgaag 77700 
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tggaaaccag 
caaatgagta 
cctttatatt 
aaattcctct 
aggactctgc 
ccactggatt 
gaagaggtcg 
ctaacctttt 
attttcaggc 
cagggagccc 
ggccatccac 
cctgcacaga 
catcatggtg 
catagagccc 
ccagaggtgc 
gtctgcgcgc 
agggccgggc 
gcagagcgcg 
ctagaagctg 
gcccctccaa 
gtatcaggtg 
ggagacccag 
gcagggctcg 
gagggggctc 
gcggatgggc 
cgccgcggcc 
agcgacaact 
agaggggagg 
ggggctttgg 
aagcagagga 
gaaaagaaaa 
gcaagtaacg 
tctcagccga 
ctccggtcac 
tcgcggcgcg 
tctgcgtgca 
cgctgtctcc 
agctgggctt 
ctgcgccccg 
acgcgcacgt 
tacatttcca 
gccctctgta 
cagggccgcc 
agagtggcac 
tgactaccct 
aattcggtta 
aaatgctaag 
tttcagaaga 
ctctcagcct 
taatgagaag 
atgctgagct 
tttacctacg 
aaaatttttt 
cccagaactt 
ctggacaaca 
ggtgtgcacc 
gagatggagg 
aagactctgt 
taaaagggaa 
agacagggca 
atttaaacct 
tagaggtagg 
ccagaaggag 
tggccaaagc 



agactctcca 
atttgaaaac 
catactccac 
cccagcacgt 
ctattagtcc 
agataaatga 
agaagtgagg 
ctggcttcca 
atacctgggc 
cagcccagga 
cagcgcctcc 
aacagagcgc 
gcacccgaga 
tgggcttccc 
acgcggcacg 
tgcagcttcc 
tgctggggag 
ctctgaagca 
gcaggaagaa 
gacctctcct 
acccacgacc 
gacgggctct 
gggcaccagg 
ggacttcggc 
tggcgggcgg 
gctgcagcct 
gggatgagac 
aagtaccagt 
gttcaagcgc 
agtgcagcga 
gggaaggacg 
gtctttcctt 
cgcatagccg 
cggcaaatga 
tccggagtcc 
ggacccggcc 
tcggaagaag 
ggatgaagtg 
aggtgggggc 
gactcgcccg 
ccctggaccc 
tccttgttca 
ttgggtgggg 
gggtttcttt 
gggttaccta 
aaatggaacc 
catagcattt 
cctggtgatc 
ccctgagcct 
taatctcatt 
taatacctag 
taacaaacct 
taaaaaaaga 
tgggaggcca 
tggtgaaacc 
tgcagtctca 
ctgcagtgag 
ctcaaaaaca 
cccctgaaca 
ctttccaaaa 
catagctttt 
atgcagagtc 
aatggcaaga 
acatgtcagg 



caatgccctc 
ctgggggctc 
tttgtgcagc 
ctccttcaaa 
cagggtctgg 
aacagtgggc 
ctgtccaact 
cctgcccctt 
acgtgctgga 
gtacagcaga 
ggtacgcctg 
tgggtgaagg 
cctctcgggc 
agccggtgcc 
gggctcccgc 
tagcaggctg 
aaagtccgca 
ccgcgggccc 
gggcaagttc 
tccctctggg 
ctacagtctc 
cctcggttcc 
ctttgcacct 
aggaagtctg 
gtgtgtttac 
cggcgggagg 
cgaggaaagc 
cacttcttcc 
tccaggaggt 
gcggggaccc 
cgcttggcga 
gggaagcctg 
gagaccctac 
gcagccagca 
cgtgggccca 
gccacggagc 
cgggggaact 
gatctgcgga 
gggggctccc 
ctggtcccgg 
ccatcttctc 
aaacaaagtg 
ttttgaactg 
ggagttagtc 
gggcaagtta 
tacctaactg 
ggcatagtaa 
ggatgcttcc 
cagtatcctc 
acatgtgagt 
gtgatgggtt 
gcacattctg 
gtgatactgg 
aggcaggagg 
ccgtctctac 
gctacccagc 
ccaagatcat 
aaacaagaat 
gatgagccac 
taaaaatact 
cacagaggga 
aggtcggcct 
gcaaaggctt 
ctctgatggt 



ctagagtaaa 
ccagctcctg 
tctcccttgt 
gagctctaga 
atgaccagca 
agagtgccac 
gctgaccctg 
tgccatccct 
atagaagccc 
gaagaagagg 
ttgcagcgtg 
gccccccagt 
cagcccgcga 
aaggagctgg 
tgagccacta 
ccgggttctc 
tctgcccagg 
atgtccggac 
aaggctaccc 
gccgggcgac 
tcgggccaag 
ctcctccccc 
cggaacccgc 
gcggctgctg 
caaagggagg 
agagggaacg 
ggagaggaga- 
agggggactc 
ccgctggaac 
gggaggaaga 
gatgggacac 
gcagtgtcgg 
gcgcgccccc 
gctgcggacg 
gccctgagcc 
ttcagcctga 
gggaacccgc 
gttgatagtt 
ttcttttccc 
acactctccc 
ccccaagccc 
cgattcggct 
tgcagctgga 
agatcgaggt 
cctctctgag 
cccaaaggaa 
gcataatgtt 
agatcaacaa 
atctgtaaaa 
ttaattgtgt 
gataggtgca 
cacatgtacc 
tggccaggtg 
atcg.cttgag 
aaaaaagaaa 
aggctgaagt 
gccactacac 
gactacagaa 
caagccaaga 
gttattaatt 
gaaactgcag 
ggcatttaag 
agctgtggga 
ttaacttctt 



tgaggctatg 
aaaagggaag 
cttatgatag 
cctgaggctg 
ggacacctgg 
ccaatctccc 
ctttctgtcc 
ccccccagcc 
tcgttcttca 
ttctccagca 
ggggccatgc 
ggccccaggg 
ggagcccctc 
ctccgcgcgc 
tcggaaacaa 
tcacccaggc 
tccccagagg 
tctcgcgcca 
tacgacccca 
agcaagccct 
ccaacagctg 
gcgcgcccct 
ttgcccccct 
actttataag 
gaaagagccc 
cgggcagcgc 
agggcaagaa 
ggtattctca 
tctggcaaac 
gaagagtcgg 
tgtgccgcgg 
cgggagccgg 
tccccgccca 
cctccgggag 
gcgccggcgc 
cagcccggtg 
cgggcgccag 
gtatttacac 
ctccccttag 
tctggcacag 
ccagactaac 
gaagccgact 
agcagtgttt 
ctgagtcttg 
cctcagcttc 
tcgcgattgt 
aattgttgct 
ttgattgact 
tagactacta 
gttaagagtg 
ataaaccacc 
ccagaactta 
tggtggttca 
ctcaggagtt 
aaaaaatagc 
gggaggatca 
tccagcctgg 
agctccaaga 
gaggaactaa 
cctcaagaca 
gcttgaagct 
tacggctcct 
atggcacaag 
aaaatgcaat 



taactttgtc 
gatgtggggc 
ccctattaag 
tcagaggctt 
cattcagtga 
cctgaagttt 
acctggccac 
cacccagccc 
gaatgatcaa 
cagccgtgca 
tggccccgag 
aagggtcctg 
atggaggccc 
actagcagtg 
ggaaggtcct 
cagggcgctc 
acagcaaggg 
ggaaagaccc 
tcttccagtt 
ccccctttcc 
ccacgtggag 
cactcactcc 
ccagccccgg 
ggcagcggtg 
cagctccccc 
gggggcgggg 
agacccagag 
tctgtgaaac 
gcgcagctct 
aggggtcaga 
gaccgcgggc 
cctcggtgtc 
cgctgctcac 
cgcaacgctt 
tggggtcttc 
gcctcgcctc 
aggtctgcga 
gcgtccggag 
gtcgagtttc 
ccccagcacc 
atcaggcagc 
gaccgcgatt 
tccgagaggc 
actttttaac 
ctcctcttta 
gatgcaggta 
gctgtcatta 
ccaggtaaat 
tggtgtggag 
ctgctaatgc 
atggcataca 
aaataaaaat 
tgcctgtaat 
cgagaccaac 
caggcatggt 
ctgagctgga 
gtgacagagt 
aggcctcaga 
tggctaccat 
tcatggtccc 
ggagcaaggt 
tccattcctc 
gagttctcgg 
actgcctccc 



77760 
77820 
77880 
77940 
78000 
78060 
78120 
78180 
78240 
78300 
78360 
78420 
78480 
78540 
78600 
78660 
78720 
78780 
78840 
78900 
78960 
79020 
79080 
79140 
79200 
79260 
79320 
79380 
79440 
79500 
79560 
79620 
79680 
79740 
79800 
79860 
79920 
79980 
80040 
80100 
80160 
80220 
80280 
80340 
80400 
80460 
80520 
80580 
80640 
80700 
80760 
80820 
80880 
80940 
81000 
81060 
81120 
81180 
81240 
81300 
81360 
81420 
81480 
81540 
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agaacttcca 

tctaagctag 

gtggctcatg 

caggagttca 

ctgagcatgg 

cctgagcctg 

ctgggtgaca 

tactcttgac 

caaggttttc 

acacttgctc 

ggacacaggc 

acctcaccta 

ctcccagtct 

tttttttttt 

ctcagctcac 

agtagctggg 

tcttttttcc 

tcggctcact 

gtagctgaga 

ggcggggttt 

cgttggcctc 

tattttttta 

ctcacgtgat 

cgctcagcct 

cttctctctg 

tactctgctt 

gcccccatcc 

tgatttgtgt 

ttgttgaata 

gtttggaatt 

ctcctggatt 

aggtttttgt 

aggagaccac 

tggctcactc 

aggagtttga 

ttatccaggc 

aatagcttga 

ggtacatgga 

aaacaccgca 

ggggaacatc 

cttagaggac 

aacctgcaca 

agaagaaaca 

ggtgactcag 

ccaggagttc 

aacaaacaaa 

ggctgaggtg 

gccactgcac 

taataaaaaa 

gatttatcaa 

acactactca 

ctagaggctg 

ggctctgcag 

tcttctagtc 

ttatttagcc 

gtaatcccag 

cagcctgacc 

ggtggcatgt 

caggagacag 

agcaagactc 

gtgcagccat 

ctatagccca 

taacacaaaa 

caaacatccc 



gatcaaggtc 

gagagtcatg 

cctgtaatcc 

agacctgctt 

tggcatgtgc 

ggaggtccag 

gagaccctgt 

aaaaatgctt 

atggaccctt 

ttcctcatct 

ctcatttggt 

aggggcactc 

ttgcacctgc 

ttttgacaga 

tgcaacctct 

attacaggtg 

tgagacagag 

gcaagctcca 

ctacaggcgc 

caccatattg 

ccaaggtgct 

gtagagatgg 

ccacccacct 

ccttcttcat 

gaagctctac 

acctctatgg 

cgccccatga 

gcctgctgcc 

aatgaaaagg 

acctctggga 

ttgttttctt 

gtttaattaa 

cataatagtt 

ctgtaatctt 

gaccagcctc 

gtggtggtac 

atctgggagg 

tgaagctgga 

tgttctcact 

acacactagg 

gggacaatag 

ttctgcacat 

aacaaacaaa 

gtctataatc 

aaggccagcc 

caaaatagcc 

ggagaatcac 

tccacctgga 

cagagaagag 

cgttctcctc 

taggttaagt 

cttcacagct 

ctgattattt 

tgttaagtgc 

taacaaatag 

cactttggga 

aacatggtga 

gcctgtaatc 

aggctgcagt 

tgtctcaaaa 

ttcggaaaac 

gcaattccac 

atgtgtatac 

agcagtccat 



aaactcctca 

gatccctttg 

caacattttg 

ggtcaacata 

ctgtagtctc 

gaagctggag 

ctcaaaaaaa 

gtgtgaacat 

tccatgaggc 

caacccacat 

gtgttcattc 

agcacatgtg 

tattccttct 

ttccagctct 

gcctcccagg 

cacaccacca 

tctcactctg 

cctcccgggt 

gcaccaccat 

gacaggctgg 

gagattacag 

ggtttcacca 

tggccaccca 

ttctaatgta 

tctaggctgg 

aagtccccac 

gatgtacatc 

tatcacaggg 

gaatggtggg 

ggccaaggac 

ttggttttgt 

tggagcatat 

cacattttgc 

atcacttcgg 

accaacatgg 

atgcctgtaa 

cagaggttgc 

agccatcatc 

cataagtagg 

gcccgttggg 

gtgcagcaaa 

ggatcctgtt 

aataattccc 

ccaacacttt 

tgggcaacat 

aggagtggtg 

ttaagcctgg 

gcaacagagc 

gaaagacacc 

tacccccaaa 

ataacagcat 

taatggacaa 

gtgtgatttt 

tgaaaataaa 

ctattataaa 

ggccaaggtg 

aaccccgtct 

ccagctactc 

gagccaagat 

aaaaaaaaaa 

agtttggcag 

tcctaaatat 

aagtatttat 

cagctgatga 



gctctacaca 

agaatacaaa 

ggaggctgag 

gtgagacccc 

agttactggg 

tgagccgaca 

aaaaaaaaaa 

ggacacacac 

tctagtggtc 

ttccatggag 

actgctgtgt 

cagtgaagag 

tgtgggaatg 

gttgcccgag 

ttcaagcaat 

cctgtgagct 

ttgcccaggc 

tcaagtgatt 

gcctggctaa 

tctcgaactc 

gtgtgagcca 

tgttggccag 

aagtgttggg 

ctcatccttc 

attcagggcc 

actgatctag 

ttgtgggggc 

cctggcgcct* 

aaaggaagct 

tttcagtctt 

ttgtttttaa 

attggaaagg 

agccataaaa 

gaggttgagg 

agaaacccca 

tcccagctac 

ggtgagccga 

ctcagcaaac 

agctgaacat 

gagtgggggt 

ccaccatgac 

ttgttttaag 

atttaaaaca 

gggaggccaa 

ggcaaaaccc 

gtgcatgcct 

gaggcggagg 

aagattttgt 

tgagatatat 

actaattcct 

tatcacattg 

gagcactgag 

gtccaaatca 

agtatccaat 

tatttaggct 

ggcagatcac 

ctactaaaaa 

aggaggctga 

cgtgccactg 

aatctctgca 

gtcctcaaat 

actacaccca 

agctgtatta 

acggagaaac 



gggggaccta 

agacagtggg 

gcaggaggat 

tatttctaca 

gaggctgaag 

tctcgccact 

gaagaaatat 

actcatcaac 

catggacccc 

ttggactgtc 

atcccagcac 

tcagtcagct 

acagatttcc 

ttggagtaca 

tctcatgcct 

gatatttttt 

tggagtgcag 

ctcctgcctc 

tttttgtatt 

ctgacctcgt 

ctgcactcgg 

gctggtcttg 

attacaggca 

acaactcagc 

ttgtccacat 

aataatcagc 

aggaaccacc 

aataagcttg 

gaaaaggtag 

gcagggtagg 

tgaagggtag 

acagagacct 

aggaatgagg 

caggcggatc 

tctctactaa 

tcaggaggct 

gatcgtgcca 

taacacagga 

tgaaaacaca 

tggggggtaa 

acacgtatac 

aagaaataaa 

ataaaaaata 

cgcgggcaga 

tgtctctaca 

gtcatcccag 

tagcagtgag 

ctctaaataa 

ttccatatct 

tcctaaactc 

gctgtcatgt 

acagggtggg 

ctccatctca 

tcaattcatt 

gggcacagtg 

ctgagtcagg 

tacaaaaatt 

ggcaggagaa 

cactctagcc 

tgaagaatgt 

agttaaacat 

agagaattga 

ttcattacag 

aaaatgtggt 



gagtcaactt 

cgcggtggca 

cacttgagcc 

aaaaattcag 

taggatgatc 

gcactccagc 

gttattgatc 

attcacattt 

catggctgga 

tgctgcatga 

ccagaacagc 

ggtttcacac 

ttcatttctt 

gtggcacgat 

cagcctccca 

tcttttcttt 

tggcgtgatc 

agcctcccaa 

ttttagtaga 

gatccgccca 

ccattttttg 

aactcttggc 

tgaaccactg 

tcaagtttca 

acccaccaaa 

ttagttttct 

acgtggtagg 

cggccaacat 

gctaaaatca 

taacaggaaa 

cgttatcgtc 

taaagcagtt 

ccaggcatgg 

acctgaggtc 

aaatacaaaa 

gaggcaggag 

ttgcattgca 

acagaaaacc 

tggacacaga 

ggggagggaa 

atatgtgaca 

gaaaaaacca 

ggccaggcat 

tctcttgagc 

aaaaatataa 

ctactcaggt 

ctgagatcgt 

ataaataaaa 

gaatcaatag 

tgttctcctg 

gggctcctgg 

tctaaatcct 

tgagcctcac 

atttaatgaa 

gctcacgcct 

agtttgagac 

agctgggtgt 

cgcttgaacc 

tgagcaacag 

acataaaatg 

agagttacca 

gaatatttgt 

ctaaaaagtg 

atacccatac 



81600 

81660 

81720 

81780 

81840 

81900 

81960 

82020 

82080 

82140 

82200 

82260 

B2320 

82380 

82440 

82500 

82560 

82620 

82680 

82740 

82800 

82860 

82920 

82980 

83040 

83100 

83160 

83220 

83280 

83340 

83400 

83460 

83520 

83580 

83640 

83700 

83760 

83820 

83880 

83940 

84000 

84060 

84120 

B4180 

84240 

84300 

84360 

84420 

84480 

84540 

84600 

84660 

84720 

84780 

84840 

84900 

84960 

85020 

85080 

85140 

85200 

85260 

85320 

85380 
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aatgtcatat tatttggcca taaaaaggaa 
cttgataatg ttattctaag tgaaagaaac 
gcatttatat gaagtgccca gaataggcaa 
tgccagagac tggagggagg agataatggg 
tttcttcttg gggtaatgaa aatgttgtac 
tgtgaatata caacatgctg aattttatac 
gtctcgctct gtcacccagg ctggagtgca 
ctgcctccca ggctcaagca atcctcctgc 
atgtgctacc atgcctggac aatttttgca 
gcccaggctg attttgaact cctggactca 
gctaggatta caggtgtaag ccaccactcc 
attgtatggt gtgcgaatta tatctcaatt 
cctttgcgtg tgtcaggcac tagggattcg 
gagaacacag agcccagcag gagagagtca 
gaagcgccat ggaagaaaga catggtgcca 
atctagactg gggctcaggg aggaatcttt 
tgtcagaatc agtagaattt atcaatcctc 
acaacaggag gatgcggaga cattccgaga 
ataggatgca gagcctgtgg tgtgaggttg 
ggatcttctg agacaataaa gagtatgcaa 
cctccctaac ttgcttgctt ccactgttgc 
ccagtggctt aagacacgaa actcaagtca 
catccagcct atcagcgagc ttgtgggtct 
ctaccatccc ccaagcccag atcaccatca 
accgggtctt cccaatccta cccccgcaac 
atcgagatgt agatttgatc aggccacatc 
aatgcacaaa ctcccaatga tggcccctga 
ctctgtgtct ttgccatggt aacctccttc 
cctcaagttc ttcctgggct ggaacattct 
atcttttgtt tgtttgtttg tttgtttgtt 
ggctggagtg caatggtgca atctatctcg 
agcaattctt ctgccttagc gtcctgagta 
cggctaattt ttgtattttc agtggaggtg 
gaactcctgg cctcaaatga ccaccctcct 
tgagccactg tgcccaggca acacttcaga 
cctcttgtag taactagcct gactccagca 
catctgagtt tccctttgat aaaactcatc 
caagggcctt tgcccacctt cctcaccact 
acccaccatg cctcctgcct gacaaattcc 
accttctggg aggcagtctt ccctgagcac 
cctgctgaac atctcttttt tgaggggcgg 
ggctgacctc aaactcctgg cctcaagcga 
gattacaggc atgagccact gtacctgacc 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnncctgggc 
aaaaaaaaaa aaaaaaaaaa aaattatttt 
tagtccaagc tacttgggag gctgaaggga 
tgcagtgagc cgtgatcgta ccactgtact 
cctaaaaata aagaagaaaa tatggcaatt 
aaggatgaga aaggaataag gaaggacaga 
ttagcaattg tattgttatc aagtaacatc 
aatactatat atgtgtatgt acatttacat 
ggaagagaaa gggctacaaa tgtaaaatca 
tattaagtat ttatgatatt tttcttataa 
aacccaaaag cagtgacaac . ccaggagcaa 
ctaccatttc caattaaaga aacccagggc 
gggcaagaaa cacacctagt gaaatggacc 
aactctctag aaataatgtg atttcatcta 
tcaatggcca aaggtgagac aatttgagca 
acatgtcaaa tatattttaa aatgaaatat 
aaggtcacta gggcaccaac tcatatttca 
ttatttctgg gtaacaaaat agtaaggaat 
aaaagtagaa gatagaaata gaaaatcatc 
agaactttgg gaggccaagg caggtggatc 



gtactgatac atgctacaac atggatgaac 85440 

cagacacaaa agaccacata ttgtatgact 85500 

atccacagag acagaaagta gattagtggt 85560 

aaatgtggaa tgactgctaa tggtatgggg 85620 

aattagataa tggtgatcat tgtaaaactt 85680 

tttattatat tttatttttt ttgagacaag 85740 

gtggcacgat ctcagctcac tgcaatctct 85800 

ctcagcctcc tgagtacctg acactacagc 85860 

tttttagtag agacagggjbt tcgctatgtt 85920 

agtgctccgc ccacctcagc ctcccaaagt 85980 

cggcctaaat tgtattcttt aaaagactga 86040 

taaaaaaaac aaaacaaaac aaaaaaaaaa 86100 

atgctgaata agacacagac cctaccctca 86160 

cagatgaatc aagtgttaca tcatctatag 86220 

tgagaacata cgcttagaga agggaatttc 86280 

cagggtgatg cttgtgctca gagttttcca 86340 

cagaggagga aacagcaaat gaaaaatctt 86400 

gctgatcaag ggctggtgtg aacaaagcac 86460 

cagctggaaa ggtaaaacac taattacatt 86520 

taatctcaaa cgaccgaaac tgaccttcct 86580 

ccgtatcata aaagcaccac cctcttctac 86640 

tcccaggctt tctccccacc tcactctcca 86700 

taccacgtaa agacttctca tctccagcta 86760 

gctcaggcct ggactcctgc aacctttcta 86820 

atgaccccaa tagcccatca gaatggacta 86880 

ccttgaaagg cttcctgtga ccctcgggga 86940 

gtcctgtgcc- atctgggtct gccctctgcc 87000 

acacccatta atactccatg ctctctccta 87060 

ctgcactagc ctagccaact aaccctttag 87120 

tgtttttgag acagtcttgc tctgttgcca 87180 

gctcactgca acctctgcct gccgggttca 87240 

gctgagacta taggcaccta ccatcacgcc 87300 

ggttttcacc atgttggcca ggctggtctc 87360 

cggcctccta aagtgctggg attacaagca 87420 

tcttaatgat catttccttt aagtgcctga 87480 

atgaatcctt ttgcaatgta acctatataa 87540 

atatatttgt tcctctgaca gttcagaggg 87600 

atcctctcac cacttaacac agaactcacc 87660 

taaccatcct tcaaatctca ctcacctatt 87720 

caagacaatg ggacacattc ctttatacac 87780 

gtagagatga gtgtctcact atgctgccca 87840 

tcctcctgcc ttggcctccc aaaatgctgg 87900 

gcaactgggt tagnnnnnnn nnnnnnnnnn 87960 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 88020 

aacaaagacc cttcctctac aaaaaaaaaa 88080 

aaattagcca gacatggtag tgcatgcctg 88140 

gaggatcact tgagcccagg aggttgaggc 88200 

ccagcttggg caacagagtg agacctcatc 88260 

tgactgtaca tctctaatgg gatatatcct 88320 

aaaaaggaaa caaagaagta gcaacagtat 88380 

aatattggta aaaccagtaa ttatatttaa 88440 

atgcatatgt taggaaccaa gtttatcaga 88500 

aggaaataaa aatttgaata aaaatatcag 88560 

aaaaattata tatatgttaa ctctatccaa 88620 

taaaaaacct cagcatccag actgtagtct 88680 

tagttgggaa aaatgacaat ttcatgtcta 88740 

tgaacattta attgtgttag aaagtaagga 88800 

aaagacacag attctgggct ggtaaagttt 88860 

tcaagaagaa tcatgacaga acagattaaa 88920 

tataaaagaa acaattagta gccatccctg 88980 

aactggtaaa taaatgtgta agccaagcat 89040 

gtttttcttt ctagaagaat tctagtgatt 89100 

cttttggcca ggggcagtgg ctgtaatccc 89160 

actggagatc aggagtttga gaccagcctg 89220 



49 



WO 02/29059 



PCTAJS01/31488 



gccaacatgg tgaaacccca tctctactaa aaatacaaaa attagccagg tagtgggtgc 89280 

ctgtaatgcc agctactcgg gatgctgagg caggagaatc gcttgaacct gggaggcgga 89340 

ggtcgcagta ggctgagatt acgccactgc actccagcct aggcaacaca gtgagactct 89400 

gtctcaaaaa aaaaaaaaga aagaaaagaa aatcatcctt ttgcgatcct aatgaaataa 89460 

tgggcctagg cattgatcat taatggctcc taaaatcact aagtatatgg ttgatgggaa 89520 

actttatagt ggatggatca gactcgcaat gtctaaacca gttgatcaat cttaacatcg 89580 

taacaagaca acagacacca ggggctgctg acaggagaac agaggaaacc catagctcta 89640 

ccactgagtt attcacggca aaaaaaaaaa aaaaaaaatt aaactgcgtt tcctccaagc 89700 

ttctaatcct gttgtttaca ggaaataccc aaggaaagga atacttttaa atgacacatt 897 60 

aaaacaacgc caaatccaaa atatggggaa atgacccagt ttcttcaaca aataaacaag 89820 

aaaaggtagg ggggaggact gttctagatt ttaaaagcta tagaagacac agcaaccaaa 89880 

tacactgcat ggaccaggca tggatcctaa ttggaacaaa ccaactgtaa aaggatgtat 89940 

ttgaaatgat tggaggaatt tgaacagtga ctgcacagta gatgatatga agaaattatt 90000 

gttatttttt aggtgtgatc atgattttat ggtgatgttt aagtaaaaga ggccttattt 90060 

gttagagata catgtacggg tatagagaaa tatttacgga tgaaatgata cgatgtctga 90120 

gatttgcttt gaaaactcta gcaggtgtgg gagaagcagg tgcatgggtg ggggaaggga 90180 

tagatgaaat aagtatgcaa aatgttagtc tacttttgtc cctcctgacc cagcaggtta 90240 

aaatacctca gcatacctct actcctccaa ccaggtccaa ggatcaggcc aaaactccct 90300 

gatgtggtaa acagcctgac cccttcttac ctctctctct ccagccactt ccctaagatt 90360 

ccccagtgct ctgtgcccta gccagcccga ctcatctgcc cagattcctc aatgtttcac 90420 

tctctcattc accattttga ccccactgtg ctcctgggcc actctccaag gccccgcctc 904 80 

ttcatctcct ccctccttac tcatccttca ggtcttggct taggtgccat tccctccagg 90540 

aagccttccc tgacaccaat cccatcctca cctagaacag attatgtgcg cttctttgtg 90600 

ccccccatgg ccccctgtgg gtttgcttca cggattataa ctgcctgact acctgccttt 90660 

ctccaccctc tagactgaga acaccttgag aaaaagaaca catctatctt gtctgtcatt 90720 

gaatccctgg tgtctggcac catggctgac acataactat cactcagtga ctagtgtttt 90780 

aatgaatgaa tgagtgcaac tagacagggt taagaacaaa- agagaagacc aggcatggtg 90840 

cttcacgcct gttatcccag cattttggga ggctgaggcg ggcagatcac ccgaggtcag 90900 

gagttcaaga ccagcctgac atggtgaaac cccgtctcta ctaaaaagac aaaaattagt 90960 

gggggatggt ggcacacgcc tgtaatccca gctacttggg aggctgaggc aggagaatct 91020 

cttgaaccca ggaggtgcag gctgcaatga tctgagatca caccactgca ctccagcctg 91080 

ggcaacagag taagactcta tcacaaaaaa aaaaaaaaaa aaaaaaaaaa gagcgagaga 91140 

agatgtcatg gggtaaatga agacctccct tcctggttcc ctgaccagcc cctgccctcc 91200 

cccgcatctc acctgtcttt cttgcttcct tctggtactt ctgtttaagc cggtccatga 91260 

gcaggccatt ccagggggca cacagcactc cgaactgagt gaaggcaaag gcatttgtgt 91320 

aggtgctgac tgcagaagga agagagaggt tggttgatga gaagtttcca aaactccctt 91380 

ccaggcaggg actctcccac cttacccttg tctgcatgtc cctcctcccc acaccatcag 91440 

accctcctct ggtgtgtaca gccctgctgg gaggctctgt gttcccagct gggacatgca 91500 

gatgggctac ctcccagccc taccacatac ctcgtgccat gtccccaccg gccatgttgg 91560 

tcagcaagga gttgagagtg ccaatgaaga ggtagtgcca caactgtatc acagacagcc 91620 

acaccaggtg ccaggcaaag cgccgagaga aagcgtagct ccagaaggag cggagttcct 91680 

gcttctgccc tgcccctggg gtctctaatg gggagaggag gatctgggcg tgaattacga 91740 

ggaaagtgga caggtaggat ggggagtgtg gaggcttcaa tggaacattt cagatccggg 91800 

cccaccttct acccttggct cccagaactc acccggcgtg ctctgcacct gcctcggact 91860 

cacacatccc cacctccctg ctttgtcatg ctggccctac caccttggat gaccctctgt 91920 

gttcttctct attgaaatcc gatctgtctc tcacagcctg gtcaatgaca cttcttgcag 91980 

taataccttc ctgatctttc tcagcgagaa aggtgaaagg aacgacaagg agaggagaaa 92040 

gtcagaaggg agaggagaat gagtgtggat actctgttct aacctgctcc tcagcacctc 92100 

cctttctttt gataccagta tcctgagttt ctttgggaaa tcttcctcta ccctaatctt 92160 

catggtccag atgggaccat gaattcagtg ttctgttcct ctctcaaggt taaccaatga 92220 

gatggttcct cctaacaagg cagaccggcc atgagtttag attaggatgg acttaatcta 92280 

aaataggtcc tacaccctgg caagttcaat gtcctccctt gattttggaa gcttcccaga 92340 

accctattct tctttcttta aaataaataa ataaatacat gttttggatc caattgtcag 92400 

atggtaaaaa taaaaacaaa aaaatcaatt ttattctgta tatttaagat atacaatatg 92460 

aggtcatagg atacatatag ctactaagat ggttactaca gttaagcaaa ttaacatatc 92520 

catcatctca catttctacc tgttttgtga caagagcagt taaaatctac ttgtttagga 92580 

aagtcccaaa cacaatgcag tttgatgacc tacagtcttc gtgctgtgca ttagatctct 92640 

aggcctgttc atcctgctca tctgctcctt tttgtccttc gacctgcatc tcccatctcc 92700 

tctcccaccc cgttcttatt tctactgtag ctagctgcgg tttgtgatgt gtgtaaccaa 92760 

agacgcagaa cagagaggaa ggaaaggaag cagtgataga gttgggacaa taagagaggg 92820 

cggacccagg agacctggag aaatgggggc actgtaccag acttagtgca atggcatcac 92880 

agaagagggc agaaccgagg agtgggggga agggaaggca acccatggca ggcgggcttc 92940 

aaggggtggg gaagtgatag gatgcgaaat agagaaaaga gggacagaaa agagacgaaa 93000 

gccctggacc ctccattaag tgagagggtt gggaagatgc ctaaggccct ttttctgtcc 93060 
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tgcctttcct gattctgggt ccctggggga gctctggagg tgaggggcca ggaaaggcac 93120 

aaggagaggc ttgggtctgg aggagagatg ggttagccag cagggctcac cttccttcgc 93180 

tgaaaggaac tcctttgact gtagctccct gttttcatgc tcagctgttt ccttctcttc 93240 

ctttgtggtg ccattcccag ggcacaggct atggaaacaa aagccccacc agcaaggcca 93300 

aggactgtga gccgaacctg agactcagac tggagggaat agcatggtga atcccacatt 93360 

ccaccgcact ttggaatcac cttttagcca ctctgatgcc caggttgcag accagaccag 93420 

ttaaatcaga atgtctggag gtgagagcca ggcttccttt tctaagatct ctatgtgaat 93480 

ctagtgattc taataagcag caaagtttag gaagcatgaa aagagtaggg caggcccagg 93540 

ttcaaatccc agctctgcct cttcctagca acagaaagat ggctcagact taacccttct 93600 

gagcctcatt ttttgcattt agaaaatgga gataaggata tctcagagga ttattgtgag 93660 

gatgaaatca gagagcacat ggggtctgac aattagtaag tgagcagcaa aggaatgccc 93720 

ttcctctact ccttgtggca aatgactgca aaaatgatca catttcttca cctcctctgt 93780 

atttccccca atttgaatga gactgcagct ctatttcccc atgccctgaa tctgggccag 93840 

ccttgtgaac tgcttcagcc aaaagaatgc agcagaagtg gctgtgccaa ttccaagctt 93900 

aaatctcaag aacgcttgtg catttctgca ctctttcaga accctgaaat cacggtgtga 93960 

atgagcccac gctggcttgc tggaggatga cagccacgtg acccaggcat ccctgtcact 94020 

ccaaacctat gtgagtgagg ccatcctagc atagccagcc cccatgtaat cctccaaatg 94080 

atcagatgta tgaatgagcc ctgtcaaaat catctacatc tggccctgat cagcggaact 94140 

agccagctac ccacagactt gtgaaaaata ataaatgctt aacattttag gctgctgagt 94200 

tttgagatag tttgttatgc agcaatagct aacagatgca ctgctccagt cctcctcctc 94260 

tcctgtgata ggtttgcttt accctgtcca tcccacccta gggccaatga ggggctctgg 94320 

cccacaatca ccagatagtc cttacccata gctgtagttg gggggcagtg ggtatgggat 94380 

gtgcccccgg ggcatcagga ggaaagtgcg tgctacatgc caggtactgc agacagagat 94440 

gaagatgaag gaggccctga ggctgatgcc tttttcataa agaagctgca gaaggagaag 94500 

gaaaaagtca gtgtcacacc cacgttcata gcagcactat tcacaatagc caaaggatgg 94560 

aagcaaacta agggtccatc agcagatgaa cagctaaaca taatgtgatc tatacacaca 94 620 

atggaatatt attcagcctt aaaaaaggaa agaggcaacc- atgctggctc acacctacaa 94680 

tcccagcact ttgggacgca cgaggatcac ctgagcccag ttcaagacca gccttgacaa 94740 

catagtgaga ccctcacccc ttctctagaa aatttttatt taattagctg ggtgtggtgg 94800 

catacacctg tagtgccagc tactcaggag gctgagtggg aggatttctt gagcccagaa 94860 

gtttgaggct gcagtgagtc atgactgggc cactgcaccc cagcctggac aatgaaacat 94920 

gaccttgcct ccaaataaaa aaaaaaaagg aaaggaaaga aattctgaca catgctgcaa 94980 

catggatgaa ctttaagagc actatgcagg gccaagctca gtggttcctg cctgtaattc 95040 

tagtgcttta gaagaccaag acaggaggat tgcttgagtc caggagcttg agaccagcct 95100 

gggaaacagc aagacctcat ctctactaaa aataaataaa taaatcagct gggcgtgatg 95160 

gtgcacgcct gtaattccag ctacttggga ggctgaggtg agaggatatg attacatgat 95220 

tacatgcctg taatcccagt actttgggag gctgaggcaa gcagatcacc tgaggccagg 95280 

agttccagac cagcctggcc aacatggtga aaccccgtct ctactgaaaa tacaaaaatt 95340 

agtggggcat ggtggcacgc acctgtaatc ccagctactc gggtggttaa ggcaggagaa 95400 

tcgcttgaac ccgggaggcg gaggttgcag tgagccaaga tcctgccacc gcactccagc 95460 

ctgggcaaca gagcgagact ctgtctcaaa aaaaaaaaaa ggttaagata gtaaatttta 95520 

tgttatgtat attttattgc atacaaaaac atcagcagaa gaggcagggg ctggaaccct 95580 

gttttctaag gagtcctagt acaagccatc acctactatc ctgtaagctg attagggaca 95640 

cctggtacac acatgccccc acccacccca agacacaccc ggcagtagag gagtcctcat 95700 

acgacccatc cccacagccg gtggagcctc ctcgtgtggc tccccagaga tcttctagcc 95760 

cagtgccttt tttcccccaa cgacagcaaa ggccttttgt tcaaagaaaa ttttacacaa 95820 

aaattcatct tacaaaacac accaatgggg agcttgccag tcatctccct ctttattctc 95880 

cttggtgact ggtatgacat caaagagaat ccctaagttc ctcaacagct cagtttgaaa 95940 

accaccgacc tagcccaacc tcctcccatt ttacagagag tgacgttgag gtccagagag 96000 

gtgcagtgaa ttgctcaata aattgacaga gtaagcagca gcaaagtcag attaaactaa 96060 

gaattcctgt tcctgctccc tttccccttc caactctaga gagacaggag agaggctggg 96120 

catggtggct catgcctgta atttcagcac tttgggaggt caaggaaggc ggattacttg 96180 

aggtcaagag ttcaagacca gcctggccaa catggcgaaa ccccatctct actaaaaata 96240 

caaaaataag ctgactgtgg tggcacgcct atagtcccag ctactcagga ggctgaggca 96300 

ggagaattgc ttaaacccac taggcagaga ttgcagtgag ccaagatccc accaatgcac 96360 

tccagcctgg gagacaaagt gagactccaa ctcaataata aaaaaaaaaa aaaaagagag 96420 

aggaaagaaa gatgaggcag ccatctgggt tctccagggg aaggagggag aacccagaaa 96480 

gtgactctta tgccaggagt agaaaggctt gagtgcctca ggggctcagt ctctgcataa 96540 

ccctccaaac ctccaaagct tatgggacta agctagactc atgtctgggt ggtgactgcc 96600 

agagatcctc ttctctgccc ccataacctg caggcagtgc caactgcctg tgacctaaca 96660 

ctaagcccag agagaagtcc caggttggat ggcttgagat ccacactctt cccttccttt 96720 - 

cactcagcca tctgtggtgt gctggcttta gtcctccagc ttgctgcctc ataattgaag 96780 

catggttgcc acaactccag ctatcacatc ctcacaccac aacattcaat gaggaagact 96840 

ttgtttttac tctgctttca ccttgcgtca gggaagaaaa gtccccttga atcttccact 96900 
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atatacactc cctttatctc attaaaaagg actggatcat atgctgacct ccacctatca 96960 

ctagcaacgg gtaaatggat tgccatggtt ggctttaatc aatcaggatt catcccctgg 97020 

gctaagcggg tcactgccca gataaaactg ttcgcaatga ataagacaga atggttgttg 97080 

attgacctct aatagccttg gcaacagttc atcccctgat accccaacat cagccactgg 97140 

gacagctgga caagcctctg tgtctgcccc tgctgtaccc actagccact tgccaccttc 97200 

ttgtccaaac tagaagctca cagcagcaaa cgccccactc taaaggtccc ccagcctcta 97260 

cccaacactg gcccaagcac attatgacca ctgccacaaa agcttgggca agtctgaaga 97320 

aggggcttag cggttacaag ctcaggctct agaaccgaca agcctgggtt caagttccag 97380 

tatcatggct actagctgca gaaccttcaa caagcttttt aacctcagag actcaaatgc 97440 

ttcatctgta aaatgggggt aacacagtac ctacctcacc gagttgatgg agacaaataa 97500 

tgcaggttca caagacaagt gtctggcata tacaagtgcc cagtgaatgt aggctgttgc 97560 

tatatttacc ttaataataa ggaagactgc cgaggaagag tcaaatgctc cattgtacag 97620 

agtgatgatg gtcgaacggt gttggccaaa taggttccca atctggggat gataggacta 97680 

gcctggatca cttatttatt catgaaacag atacttcctg agcacccagc atgtggcaga 97740 

ccctccttat acccaaactc accctccacc gctagagctc ccacctcagc ttgggccaac 97800 

cccatctgag gcagccaatt atagaaaagg gtctctcctt ccctccacct tcccgccacc 97860 

ctgccgagtg cctgggatta gggaaggctc ccacctgcag gttggtgatg agaaacagga 97920 

ttcccccaat ggtgagcatt ggcatggcca ggaagagcag cacggctgag cctggaccat. 97980 

caaagtcaga ggtagggtgg tgtcatagtg caacccaaac atggagcccc aaactctgct 98040 

cccacctgct ccaaattccc aacaatcctg gtatccaggc cccaattcta gccagcgttc 98100 

cagcgtcctt caagggtttt taggataccg gccaaggctt ccccagatat ctctgtggaa 98160 

gtcttctgag ccaccttctt cccaaccaaa gttggtcctc agtctgtggc aggccaggaa 98220 

gtctacagac agaggcagag ctctaagtga agccacctct ctcttccctc agtaaaccac 98280 

aagctgcctc tccctttcat ccttgacact cctggaaaag aagaccctgg actcaggtcc 98340 

ctggctcaac cctctagccc attccctaat tcatggtatt ggccttgagc ttcaatcatc 98400 

tgtttaatgg gaacaacagt tcctgctctt cctgtctcag gtgctatgag aactgagtga 98460 

gaaaaggacc atggtctttt ctttgttcac taaactctga* gcacttcttt ggtgccaggc 98520 

attgtgcttg gcactggaaa tgcaagatga atcagatagt ccttgccctt aaatagactg 98580 

acatgcaaac aaatggttat aacaggtctg gtaagtgtga gaccacagca aaaaagctca 98640 

agagctgggc taggggaacc cttgacaaat tcttcctccc caaaccagac ttctgcccac 98700 

cattattctg gccacaacct atgcctgtcc tattatttgc taaaatgttt taagttgact 98760 

cacttttatc caaaaagtat ctatttttaa aggacacttt atatcactac tgtagatgaa 98820 

aacactggca ttacttgtca tgaatagaaa gtaactgtca aaataaatac aatgaaagga 98880 

aaacaatgtt attcaattgt agctggatgc atttgacctt agaatgttca aagcctaaga 98940 

cctgctcttc ccatcagtgt taaaatcaca ctggccccac atgaagacat tctttcatga 99000 

aatcagaagg actgaaagag aaataaaaag ggaatagctg ttctaccagg tgatttgatg 99060 

tttgttagtg tagttcacgt agtatgcgtg tgcccctaac atcctcttaa ctaccgtgct 99120 

ataccttaag aagcactgcc aagagctaat tttagagtat tcacacagtt taccattcaa 99180 

tttctgtctt tataaaatgt acatctctcc tactactaaa ggttggagac tcctttcaca 99240 

atagagtcct tatgggctca atgctttttt caaaactgaa aagccctata ttatggagga 99300 

agaggaggat tgttgctcag acgatttgca ggcacgagtc aaacattacc cagccaccac 99360 

ctccacattc agttgcttaa aaatcattta caggctttta gagtagatga tgctggtttg 99420 

ataaggagag tggtttgaaa taattggttt gaggtgctgg gccatctcat gagatctgtg 99480 

tgaacaaaga cactcagcct ctgtgtttgc ccagcatgag tgcagacaat ctcatgatgc 99540 

tgtcagcttt agcatagctt acacacacaa gagtaatgta ctttctttcc taaaccaaaa 99600 

attgagccac gggtctaaca ctaggaagga atattgggag gcatctcgtg gccaccataa 99660 

ccaaggcaat gacagaaaga agagtgaggg atcaggaggc ctgcacatca ggcccacctc 99720 

ccacttgctt tctctgtggc catggacatg tctttgcaag gggtcctgct gtggcttcag 99780 

tttctcctct gtgtaatggg tggaagggtg gtggaaaata aaccagattg gagttccaga 99840 

cttaacagac tggtgaaatt ttaaaacaaa gattttgagt acaatagggt tgtcaacttt 99900 

taccctgcta agtaaggata tttgcaaaat ggtcattcat ataatcattt cattaaaaag 99960 

agaaagagaa cattttaaca cataggagaa ggatgtaaag gttttttgtt gttgttttgt 100020 

tttttagggt tttttgtttg ttttttggta gagtctcact ttgtcaccca ggctggagtg 100080 

caatggtgtg atcttggctc actgcaacct ctgcctcctg ggttcaggca attctcctgc 100140 

ctcagcctcc tgagtggctg ggattacagg cgcgcaccac catccagcta atttttgtat 100200 

tttttttagt agagaaggga tttcaccatg ttggccaggc tggtcttgaa ctcctgacct 100260 

cagatgatcc acccgcctca gcctcccaaa gtgctgggat tacaggagtg agccactgca 100320 

cctggccaga tgtaaagttt tgaataaatt ctactctctg aagtaatccc tctccatcat 100380 

ccttgctttt cacattttct caataaactg ttttcacaga ccagcaatag ctcaagatcc 100440 

ttccaggatt ctttcaagct gcagatctct gaataaccat gtggtctgta tatcttgcct 100500 

atagccctct gctcacacct gccccagcca ccaggtgcct ctgagcttgc atccctccca 100560 

cccacctgac agcactcacc tgcagaggtg aaggctatga tgagtgtggc ggtggtgtag 100620 

aaaaatctga aacacataac aggaaaagca gaatattgtc aaggagggag aaacctggga 100680 

gaaaaaacat gattctgctc agccagccca caagtgtagg acttgaccgc accctcagcc 100740 
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tgggatgcaa 
gcgatgagat 
ccaaggctag 
acttagccag 
aaccccttgg 
ctcctcactc 
tataaaaaga 
ccaaggttat 
aaagcaatat 
gtagcaggga 
aaattgtaag 
cccgtctctc 
ctgtatcagc 
aggatgcagc 
ggaaaccccc 
ccaaagaaac 
gggggcggtg 
gatgaggaca 
ttgtaaccct 
agcacgtgat 
ggggctgtgg 
agctgctggg 
ggatttcact 
ctggcctttg 
atgtagccag 
gagaacctct 
agccttcctg 
tcgcaaactt 
ctggtgcctc 
tctcatcata 
tgggagccag 
gcattgccaa 
aagacaaaca 
agcccagtca 
ggagtggatc 
gcgtttgagc 
tggttgtttc 
aagatagagg 
tgtgaggtgc 
tactccaata 
ggtcttgatg 
gagcaggtgg 
gacagcaact 
ctgccaagtt 
aggaacttgc 
aaataccccc 
tcctcgcatc 
cccgctccag 
ggcgccccat 
tctccgcgcc 
tcccctgctc 
tctccatcac 
ccctcttgaa 
cctcgctgcc 
aatgctgtta 
cacagacagc 
cgctgggttg 
gtttcacact 
aggactgagg 
aaacccctta 
gccagactgc 
attgctcatc 
tccagcccag 
aagctcagtt 



cgggcactga 
tgagatgtgg 
catggaaagt 
gacagaatgg 
agaaaggggt 
tgtgaccttg 
aatgatcgga 
caggctccct 
cctaaggctg 
gccaaggacc 
ggactggctt 
tctctgacag 
ccccagcagg 
aacacactgt 
ccagtctccc 
ttcctattag 
aggaggatga 
aggatccttg 
ggctccaagc 
agggaagcca 
tgaagtgggc 
agcgagctgc 
ccaagctagg 
ggacttacat 
tggggaatgt 
catcctgggc 
ccaaatgagc 
gtcccttaag 
cctcccaggc 
gaaagagggg 
acttaacttc 
tcggcccagc 
ctagtgaagg 
gcagtgtggc 
ttcaaatccc 
acttggaaaa 
aggcctgggc 
ctgctggaag 
attaagaggc 
catcatctta 
gccacctgga 
agaagcggag 
caaccaggtg 
ggtgttatct 
ccaaggtcac 
aactcccatg 
aacttctacc 
cctgcggcct 
ggcgcccgaa 
ctttccgcac 
gcctccgccc 
ttccttcttc 
acttctccct 
tttcagcctc 
ctcgctcagg 
acatgagggc 
gcagggaaca 
cccagagctg 
gcttttcaga 
gatgactccc 
attgattaac 
ggagcttgtg 
agcacgtgag 
^gggagagc 



tgcctctgag 
tcctgacctc 
cttggacaat 
gcaaagcgaa 
tagtgcctgg 
ggcaaggcgc 
ctgatctagc 
tagcatttgg 
gagaagaggg 
aagaaggact 
tactctggct 
ccaatctcac 
ccgctgatta 
gtgtgagatc 
ctccagtcag 
ctaagcccct 
ggatgaagcc 
gggtgaagga 
cagcccttac 
catttatgca 
aagtctagag 
aggtgtggtg 
ctgctgcctg 
ggctatgagg 
catgaagttg 
tttgcagtct 
acacaggctg 
ctgagttgaa 
acttcccaga 
tgggcagggg 
cttagaaaag 
atctggtcca 
ccagccaaag 
cacgtgcagg 
actttgtcct 
ttcctctggc 
aaaaaccatc 
aggaggcctg 
aggatgatca 
ggcactgcat 
gtcacaggtg 
ctcctgcgct 
cagaggcagg 
tcagtttact 
agggctagtt 
accctcgcct 
aggaaagcct 
gccctccccg 
aggggaccct 
gggccaggtt 
cctcctgccc 
ttttcccttg 
ctagaacccc 
ccagccccct 
cccacctagg 
atagggacac 
tatttttcct 
ccaaaaaaat 
aaccatcccc 
tgcctcaccc 
cattcattgc 
atcagcgtga 
ctgacgtctt 
ttcttgcgta 



ccccaggctc 

atggacagtg 

gagctcagct 

ggtcctccca 

tctgcaaatc 

tgtccttctc 

tcagagtgct 

gggtcttgta 

aggggacaaa 

gaggtacagt 

gtttccggaa 

atgtgcctcc 

ccactgagcc 

acgtcccgaa 

aagggacctg 

agggagtgat 

tgggcaacct 

agagaagagc 

aggaagtcac 

gagcagggaa 

agagtcccgc 

cctggcaggg 

aaggattcct 

cgtgccacgg 

ttcatgaagg 

ggagtagaaa 

ggctcccctc 

atgtggctgc 

tcaagtgggg' 

gcagagtcct 

tcatccctgc 

cacagatcct 

aggacgccag 

ggcaggccct 

cctggacgga 

aagccaagcc 

agcgggtgat 

cgggaaaggg 

tggccgctgg 

ttgagtaacc 

agataaacga 

cgaattctga 

gagatttcat 

gacaatgaaa 

tagtcacgac 

aggactcgca 

ccgggggccg 

cagaggagcc 

tcgccctacc 

cgcattcgcg 

gcccggaagg 

ctcacagcct 

ctagaacccc 

tgcctctgcc 

gaaaatgtca 

acacactcta 

atttgctcac 

cccaaccaca 

tggaggactg 

ccgcccccca 

cccatttttt 

ggccttacta 

ctttggcctg 

ttagatgcag 



aaaaccaggc 
cattttgctc 
gacgatgtga 
cctggaagcc 
aaggccttga 
tgggcctcag 
atgatttcag 
aggcatggag 
ggaaggggag 
cattctgcat 
aggcaggccc 
ctgggagcac 
tggccacaga 
ccttctgact 
aaattccacc 
tggctgttgg 
ggatgtgagg 
aattttaggt 
cctggcctcc 
cgaggtaagg 
tgcctggggc 
tggccgggct 
cttacccacc 
tggtcttgaa 
accccagggt 
aaaggtctcc 
cacctcagac 
ccctaattac 
tgagagctgc 
tcctgctcct 
ccttaccagc 
taaagtaatc 
caaagcccag 
ggcccgccat 
tcacaggcgc 
cttcctttcc 
tctctggatc 
aaaggtagac 
cagcaaatgt 
acgtggcaag 
cttacccaat 
ataccgtccc 
acagaagaca 
caaaagctcc 
gcaggccatt 
aacctggtcc 
ctccccgcca 
cgaggggcca 
cgcctgctcc 
cctctcgcag 
ggctggggca 
cccgcgccct 
agcggtgtct 
tcccctaacc 
cacccagcac 
tttgtgcatt 
cagcttaacc 
gaatcaggaa 
ccccatattt 
ggttctgaaa 
attaatcaaa 
agcagctgcc 
tgtggccgtt 
tctgcagact 



gcaagaagcc 
attctgaggc 
ttggcttggg 
ccaacagccc 
gttctaactc 
gagccttttc 
gactacagtc 
taaaaaaaaa 
gaaaggggag 
ccaaaggctt 
agccagccct 
ctgctctgag 
gcacgagatt 
catctgcaca 
agtggcaata 
ggcggggagg 
ctgtgcaggg 
tttgctaatt 
ggctcaattc 
aaatggaagt 
tgttcctaac 
gtctgactct 
tttgcctggg 
ccggtcaaag 
gaagatgagt 
catgcatccc 
agcttgtcgg 
ccctcaggag 
tgacccttcc 
tgccaccacg 
ctgccctgtg 
ttcattcttg 
gcattccagc 
gagcagaagt 
cgtaagcctg 
cgtagctctc 
ctgtagaata 
tagagttatt 
ggggaataaa 
tagagaggca 
agctccccgg 
ctaaaataat 
caaactcccg 
catggatttc 
ctactgccag 
ccgccgccct 
gcctccgcac 
ggccgcgctc 
gcgccggggc 
cccctcccag 
gacctcccac 
ttttacctct 
ttccctccct 
aagttagttg 
ccagaggaca 
ttgccttgac 
gtctctccca 
gccaagaacc 
tcactcccaa 
gagccttccc 
gacatatata 
ttactatcct 
tccttgccaa 
cccaacccca 



100800 
100860 
100920 
100980 
101040 
101100 
101160 
101220 
101280 
101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101820 
101880 
101940 
102000 
102060 
102120 
102180 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960 
103020 
103080 
103140 
103200 
103260 
103320 
103380 
103440 
103500 
103560 
103620 
103680 
103740 
103800 
103860 
103920 
103980 
104040 
104100 
104160 
104220 
104280 
104340 
104400 
104460 
104520 
104580 
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gctacctgga 
tttaagagga 
caccaccaaa 
gaagggtggc 
tctttggctg 
aaaagaagta 
gtagggtagt 
ggaggaattt 
tctgaaacgt 
gtcttttctt 
acaaacaggg 
ttaatgcatg 
gtgtggaagt 
gttaatctgc 
tgaagcacgg 
cttgactctt 
cactaacaca 
cccatgttct 
tcctgcgtgg 
ggtctgacgg 
acaccttttt 
tgtgccaagt 
ctcttgtggg 
aggcagatgc 
ggaaccagct 
tctaagtgag 
cccttaatat 
cacagaggca 
gttcccactg 
tggtaaggac 
agggagcata 
tttccttccc 
gacagagtgg 
tataagtgag 
ctcaagagca 
tcctctgaac 
tcctaccctc 
ataaagggct 
tcacaatggg 
ttctacagcc 
cagagaaaat 
agccaattac 
tatggattaa 
tactgttgtt 
tagggagaat 
aagtaatggg 
tagatgacac 
gcttggattg 
gatggagtct 
gagggtgggg 
tttctcttta 
cactctggcc 
cagcctggat 
actggaagca 
cttatgaggc 
caactgttta 
cactgacaat 
ggaggcacgt 
tgagcactga 
cgataggtgt 
aactccccca 
ctgaaactaa 
tcaccaaggc 
tcaagtgatt 



tcccctgagg 

agaaatagag 

gcacaaaaga 

tggggtcctg 

ccccacctgt 

taagatgggt 

agattttcta 

caaaggaact 

atgctctcat 

aaaataacat 

cttaagatgc 

ccctatcttc 

agcaagggag 

aggagcctgc 

gacacggggc 

agagcctcca 

agtcttactg 

ttggttcgaa 

ctccaatgag 

tgcacccgat 

ttttttaatt 

gccgggattc 

agagacagac 

tagcacagca 

agagcacagg 

ggcattaaga 

ctctcctatc 

tcatctgctt 

gatggcttca 

cagtccaggg 

gaactggatc 

atcatcagca 

gaagatggat 

ctgcactcca 

ccaccttcca 

aacctcagcg 

tgacatcact 

ttgatttcgg 

ctcatgtaaa 

tgttttggaa 

tctaatcctc 

tggtatctct 

agataatgta 

aacaaatatt 

ttcaaagtgt 

tgagctcagt 

ccaaatgcac 

tcctgtaagc 

tagagctggc 

aactactaca 

acagaggtga 

cctcagagca 

cagcctccct 

ctgggccagg 

tactgctacc 

ctggtaactg 

atctaaacct 

cctacccctg 

ctatgtatct 

tgttattagc 

acatcaccag 

tgcttttttt 

tggagtgaaa 

ctcctgcctc 



gcccaggaac 
gttacgatag 
taaaagtctg 
taccccaagc 
aagacgagga 
gtatgaggtc 
gtctgcagtt 
tgcccacggt 
caccccgtct 
tatttatggg 
aagattttct 
tgctctggca 
aaacagaaaa 
cagaaagatc 
aggtcgtcca 
ctccacatct 
aactgatggg 
ttccagcagc 
gattaatgag 
gtcttcatcc 
gactgattgg 
gtagagaaga 
aaagaaaccc 
cattgaaagg 
gtggtgaaga 
attcagccca 
tatccgtggc 
ccaagccatc 
cttaactgct 
tctggggatg 
tcctgcatcc 
agtctggata 
atgtttaggt 
tcattcccct 
atgcagtccc 
agggcaattg 
ttctaaattt 
atacgccaaa 
ttaaaatgtt 
gtcagaaagc 
ttgaggcagt 
gagcttccgt 
tgaaaactgc 
attgacttac 
tcctaaccta 
ttaccttgct 
tcagtacaac 
tgtgggaagt 
tcttcagggt 
tagccaagta 
atctgaaagg 
gccagctcag 
ttcaccatgc 
cctaagagaa 
caccctccca 
cacttcccga 
gcaactctaa 
agaagaaaaa 
gattcgaagc 
atctattatc 
gtagcagtgt 
tttttttttt 
tggtgtgatc 
agcctcccga 



tccagctatt 

gggacagcca 

caaccaccct 

tactcactag 

caatagtacc 

cctgcatggc 

atgtagacag 

cattctacaa 

taacaaacat 

agaaaatcca 

attttactgc 

tacattttag 

aagtcaaagt 

tagctcatgg 

gggttctgtc 

cccatcaatg 

acaggaaatt 

tagaaaaggc 

taacatcaga 

ttttctcttc 

ttcaacaaat 

gattcagtgc 

aagatttctg 

cacggaacct 

aagtggaaag 

catcaatcaa 

cactgctcta- 

gccattctcc 

caaaaccctt 

ccagccatga 

tactgcccaa 

tccaggatcc 

tagggaaaga 

ggcacaaaca 

tgcctgtcca 

ccactctctg 

gtatatgtag 

cacataaaca 

tggttttcca 

aaaaggtaaa 

gccaaaaata 

ctcctcatct 

ccaattagta 

tggaaaacaa 

aaaaagaaat 

taagaagtcc 

cccccaagac 

ctgttcatga 

atctcctagt 

aatatgaggc 

atacccaaag 

cttcagtgga 

agaaaacaga 

agcagagccc 

agccccagct 

tagactttgc 

ggactaggct 

taaggaatat 

gctttttgct 

tgggccaact 

caggactggg 

tttttttttt 

tcagttcact 

gtagctggga 



ccaagcccac 
gaactgagga 
agtgacttga 
ttatacaacc 
ttaattatag 
gcaggtgcta 
agccagagaa 
agctgcagta 
ttggacatta 
caaaaatata 
aagacacaaa 
tctcctgggg 
aaagagacag 
gctatctgta 
caccttatct 
tctgcagaag 
agaatatcct 
agatgctatt 
gagagaagtg 
gcctccttcc 
acatgtggta 
ctgctctcaa 
gagtgtggga 
caacaaaaca 
atttgaggcc 
tcatgtccta 
tgcagacact 
tgcaagagtt 
ctgaggtcca 
gacattgctt 
gtaccaatgc 
accctatgga 
gggtttgcca 
atggctagta 
cagacctctc 
ggcagagtcc 
ataaactctg 
aacaaacata 
cctacggtct 
tgcaaacatc 
atacaagcac 
ataaaattgg 
cactattaat 
agaggaaata 
aaattagggg 
caccctagag 
tgtctgggct 
gccacagtag 
gtgttcaaag 
ctccttgctc 
aggcactgga 
tgctagaggc 
gctcccccac 
caagccccac 
ccacttctat 
tagaaaggaa 
ggggaacact 
caataatacc 
taatctgtca 
gaggcctaga 
atagaaacct 
tgagacagca 
gcagcctcca 
ttacaggcgt 



tcctcttttt 
ttttccagct 
ctgaatggag 
tgaggcaagc 
gaattgtcat 
taggcagatt 
gcagctctgg 
ccttcccaac 
gagaaaacaa 
gcatcccagg 
gactctgaaa 
ggatcagtaa 
attttagaat 
catccaggac 
tgttacctct 
acgtggcctc 
ctgaaccatt 
ctgatcactc 
attataataa 
tcatcatctc 
cctcaggctc 
ggggctcatt 
atggtcttcc 
ataacattta 
agcgtcgcca 
ttgatttcac 
catcatctct 
tatttccatg 
gtcaactggc 
tgaggggaag 
tggaggtggt 
tgtttttatg 
aagagggcag 
tcctctagtc 
tcctcaaact 
agatattctc 
agccattcac 
agctttcctt 
tgaaaggggt 
atttcacctg 
actgctatcg 
aatcgagctg 
aaatagcagc 
aagtcacatt 
gaaaaccact 
aactgatctc 
taaggcaggg 
acaggaaggg 
cagttctcag 
tggggagacc 
gggtgggggc 
agcagaggat 
cagaccagat 
cacaccaggg 
gttcatcaag 
tgcctcagtg 
gtgtcaacat 
tgctgggcac 
ctgaatctca 
gggacgaagc 
gatgctctga 
tctcactctg 
cttcccaggt 
gcaccaccat 



104640 
104700 
104760 
104820 
104880 
104940 
105000 
105060 
105120 
105180 
105240 
105300 
105360 
105420 
105480 
105540 
105600 
105660 
105720 
105780 
105840 
105900 
105960 
106020 
106080 
106140 
106200 
106260 
106320 
106380 
106440 
106500 
106560 
106620 
106680 
106740 
106800 
106860 
106920 
106980 
107040 
107100 
107160 
107220 
107280 
107340 
107400 
107460 
107520 
107580 
107640 
107700 
107760 
107820 
107880 
107940 
108000 
108060 
108120 
108180 
108240 
108300 
108360 
108420 
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gcccagctaa tttttttgca ttttttgtag 
gtctcaaact cctgggctca agtgttctgc 
caggcgtgag ccaccatgcc cagccagctg 
cctgggaggc ttcccaagag ctacacagag 
accagttatt actaatagga atgaagatgg 
aacaagagag tagtgctgcc tctctgtgat 
accacaccca cccacacgca tgatcatgaa 
actcctgagt gaaaacaacg gggtttttca 
tgaaaagagc aggaaactgc tggggccgat 
caaagggaag ccggaagaga ctggaacagt 
tagggattag ctggagggag aaggagaagc 
gagagcacat gtaggatggg aaccccagat 
caagtctgtg gtgaaattag ggaaggctcc 
ggaagtggga gacccagggt ccagtcctgg 
gcacctttct ctcttggggc ctccattcct 
tagccctgac ttgctgaaat gctgggactc 
gtcgtggggt gctgcaatgt tcctccagat 
atggtgagtc cgccctggga gcctctgctt 
aaaccacgga agttcccggg gctgatgggt 
tgggctgcaa aaccgcaaga tgggctgtga 
aaagcattct gccctccagg tcttaaagca 
acaaaattgc ctcaggggca tcagggaggc 
ttctcaatga aagcaccctt ttgggtatta 
tgagtgctgc ttttgggaca ggcattgggc 
tgatgcccac agccacccca taaggaggcg 
tgaagaaact gaggcctcaa gagatgaagt 
ggaagagata gacttcccag tatccagagc 
cttttcctgt gccaatgtga tctgcctcca 
acagaagtcc tctctgtgtc ctcttcagcc 
gacatctctg ttatactact ccaatggtgt 
ttgttgtgat taagtatagg ggaaatgacc 
gataaacctg aaatgcaatt tattaattaa 
gaatcctaag gctgactggt cagcacaatc 
gaaatcaata ccagaaggtt ctttgttgag 
ttgacacata ggtgaagctt gacatacctc 
atatccaagg cagggagcca tctgggtgtc 
gatcctatga agatcttacc tttctggctg 
ttccttgttt gcttagatta atttctgcgt 
tgagacccac tgaagaggaa taatcaatac 
tcacttctgt gttgtcactg tatcctcatg 
aaaacatgga cttggaagcc agaccgtctg 
ctctgcaacc tcgggcagat tacctaagtc 
tagcacccac ctcaacatct gtcaagagga 
gaacagtgtc tgccatctgg taagcagtcn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnc taatttttgt atttttggta 
ggtctcaaac tcccgacctc aggtgatcca 
acaggcatga gccaccacgc caggccccac 
cgggaagtgc ctgtggggct ggtgtggaca 
ggattctcca tgacctctgt tactcatatt 
aggacagtct gatggctggg caaaccctgc 
accagagtcc ttccgataca tgattctggg 
ttccagagat atccatgcat gcatcggcat 
cccacaagct tttgacatca atagtagcat 
taacttggta tctctggtac tgctcaatat 
gctcacatct gtaatcccag cattttggga 
gagttcgaaa ccagcctggc catcatggtg 
ttagccaggt gtggtggcag gcgcctgtaa 
aattgcttga acctgggagg cagatgttgc 
gggtaggcaa caaagcgaga ctctgtctca 
ctttattctt tttgatggct gtaggtggat 
tgataatatt taagttatgt cttcagcatc 
ctatccatct gtaaatgtct ttttgaacat 
aatgagaatt tctgggtcaa agaggatatg 



agatggggtt tcgccatgtt ggccagactg 108480 

ctgccttggc cccacaaagt gctaggatta 10854 0 

atgctcttaa tctgtgccct acccagcctt 108600 

catgagttct ggaatcgggt tgatgggggt 108660 

gtaattcttt cagacagcac ccttgattaa 108720 

tctgtgtctc cctgccctgc tcacacagac 108780 

aagaggaaat ggatccagga gaaggagacg 108840 

cattgagagc tttgcccaac accccaaaga 108900 

tgaacactgg acttttgttg tggaaaaagg 108960 

ttccatggtg ctggaggatg gggaagtggg 109020 

tggggtggga ggggaacctc cacttgccag 109080 

gatactcaag gcatggcatt agaccagaag 109140 

actgcggact gtagacagag cactggacaa 109200 

ctctggagcc ccctgggtgg gctgccccgg 109260 

acctctgtga agcgagtgct gaacctctct 109320 

tgtacagagg ctgacattaa gcagggatct 109380 

gctgcacggg agagggcaga aaaggcctat 109440 

ggaagctgaa gtggcctgag agtgactcag 109500 

tcttatagat tgtacatgca gctctcctcg 109560 

ccactctcaa ggaaagagcc ctatctgcaa 109620 

aacacagact caatccttat tccttttaag 109680 

agcaggcctc aaatgtgtgc ctttctagaa 109740 

ataatgacaa cagtaatgac agtcatttac 109800 

taagagctat atgtaatata tattattatt 109860 

gaggtactat cattatgcca actttaaaga 109920 

aacttggcca agtcactcag ccagtaaatg 109980 

ccatgttttc accattatgc tgaagtacct 110040 

ggaatcctgt cttgatgttc ccttccccat 110100 

tgatagtata' tcttttcata ccattctttg 110160 

tccctcccct acccctccct gggagcttag 110220 

cacactaaac aaactcataa gagactgatt 110280 

cactgagaaa tgaaaccacc cagcagatgg 110340 

tctttcagga aggacaggct tttgggaaag 110400 

tacaaagtca gagggaaggg agttgatgga 110460 

tataaagcct ccatcctgcc aaggatcaga 110520 

ctctcctttg gacagtgctg gatttttctg 110580 

catttatcat gattgtggaa ggctttttgt 110640 

atttaataga actgaaaggc aatttcccat 110700 

atactagttg tgttgccctt tgcagagaat 110760 

cttccttata atggagggac agagatggta 110820 

ggtttgaatc ctggctctgt tacttataag 110880 

agtttcccct tctctgaatt ggggatataa 110940 

ttcaatgagg gaatacacat aaagtgctca 111000 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 111060 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 111120 

gagacagggt tttgccatgt tggccaagct 111180 

cccgcctcgg cctcccaaag tgctgggatt 111240 

acacctttta aacaaccaga tttcattcat 111300 

tgtgggtgaa ggtggcactg ggagaagtta 111360 

cccacactcc tcaaattagc ctgagtctcg 111420 

ggcaaaccat tccccagccc tgccctctca 111480 

cagctgttgt tacccgtgtc ctccatgttc 111540 

atgtgtataa ttattatatc tatatttcat 111600 

attattaaat tgttctgtac attattttat 111660 

aagaacatat agacctggcc aggcacagtg 111720 

ggctgagatg ggtggatcac ttgaggtcag 111780 

aaacccccat ttctactaaa aatacaaaaa 111840 

tcccagctac ttgggaggct gaggcaggag 111900 

agtaagccga gatcacgcca ctgcactcca 111960 

ataaaaaaaa aaaaggaatg tatagacctt 112020 

gttctaaaat ttgtgtaacc aatctcctat 112080 

atatgaaact tacaaacaag gttgcattga 112140 

ttctagaata attgcaggat aaactcctaa 112200 

cattttacat ttaatagata tttgtcaaat 112260 
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tgtcttccaa 

tttccccaca 

gaagctgttg 

agagaggtta 

cgatagcaga 

gaatggagga 

ccaccctgaa 

agtatttgga 

gaacattata 

atataatgta 

tttggcaggc 

catagcgaga 

atgcatgctt 

gagtttgagg 

aggaagacca 

aaaagcagag 

ctgttactta 

aacatataaa 

gagataatgt 

ggaaattcat 

ctttgatctg 

aggtataggg 

accaggcagg 

cccaaagggg 

tttatcgcgc 

gcttttacaa 

tttaaaatta 

tagcgataca 

tcagctcaaa 

gtaaaattag 

gataatgttt 

gaggatgatc 

caggctggct 

actgcacact 

agggtatatg 

acaagatttt 

ctaggaggac 

gtctcaaaaa 

acgtgctcac 

tcatcacagt 

tttgttttta 

tcttcaagta 

catattctcc 

tgcaaattct 

tttctgcata 

aactgcagtt 

gcctggagct 

ttagtcagtc 

tccagagttt 

agtgctgggt 

cacacggctt 

tgtttactct 

ctgttgaagt 

ttaattgctt 

aactacaccc 

gtcactggcc 

tcccccacta 

ccatttgccc 

agtggcttca 

cagaaatgag 

ggtgcctata 

cagagtctct 

tcacggccac 

ttacccactc 



agtggtcgta 

ccctggagag 

gccgttatat 

gttagtgggt 

ggagggtgac 

cttaaaatat 

tgtgccagat 

tgggagaaat 

cattctatat 

tcagtaaaga 

cagggtggga 

ctctgtctcc 

atagtcccag 

ctgcagtgag 

tgtttctaaa 

tggtaagaga 

cccatcgtgt 

atagaagtaa 

atataaagct 

tgttgtagtg 

gatttggctc 

tggtctgagc 

tgctgataag 

ggtgtacaaa 

actgaaaaac 

aaaatgctaa 

ctaagtcata 

ttaatgactg 

ttctgcaatt 

gcaggagggc 

gtttttatct 

tggctgtgac 

gggcaggtgt 

tggtcaaaga 

agcacctgtt 

attgtgcatg 

tgcttgagcc 

aaagaaagaa 

ttctctcttt 

gagcacaggc 

cagaactcat 

aaaggaaaac 

agactttgtc 

tttatcctga 

ttctttataa 

ttcttaacca 

tttttttgtc 

aaaaagtaat 

tgcacactgc 

gatggtgttt 

tcatttctat 

ctgcataccc 

cttgaggctt 

ccagccagat 

tggacagatt 

tactgtgttt 

ccagtagcag 

cacctcctac 

acccacagaa 

aaaagcgttg 

aggaaatggg 

tgccccatga 

tctaggcagg 

tgctgaacac 



ccaattaaca 

atgaaaaatt 

aaccctcgtc 

gcaaacatac 

tatagttaac 

tccaacacat 

cttgtttgtt 

gataaatgct 

atgtaacaaa 

gagggctggg 

ggatagcttg 

acaaaaaata 

ctacttggga 

cctacgactg 

agaaataaat 

aaggactttg 

gataagcaaa 

aaatcatacc 

tctggaataa 

ggcagccttc 

aggcaagacc 

ttccccagca 

accggaatgg 

acgatctcgt 

aaaaccaaca 

agtaggcatt 

gtacataatg 

aagttcttta 

acacaggctg 

taacctcgct 

cacagagttg 

acctgtcacc 

cccctttctc 

ggaagacctt 

ctgtcctgcc 

gtagcatgag 

caggagtttg 

agaaagaaaa 

ggaatgtagt 

taaagcatct 

tatgttgtga 

cacccatcat 

aacatgaata 

cttttataaa 

ttatcatttt 

tttcactgtc 

tattgcatta 

aatattttca 

tcccagagat 

ataaacttct 

ctctttgatc 

tcttgggtga 

gttttataaa 

ttggccactg 

aaattaattg 

gatcctatag 

gaggctccat 

aataccccac 

tttaagggga 

ttcacatggg 

ggagggttgg 

tagagggttc 

gcttcccaat 

agatgttacc 



ccccgacctg 

tatgggccca 

attaataagc 

aattagatag 

aacaatgtat 

agaaataata 

cttggaagct 

tgaggtgata 

atatcacacg 

cacggtggct 

aggccaggag 

aaaataaaaa 

ggctgatgca 

caccactgca 

taaataaaat 

gggctcaaca 

tgccttaacc 

cacttcaagg 

tgcctggcac 

tgaatctgtg 

tggggaaggg 

gagtgaggct 

gaggctggag 

atgactcctt 

tatttaatga 

cacatgttta 

tgtgagccac 

aacatcaaca 

gggttgaaac 

gaatctcaga 

ttgtgaagat 

ccactgatct 

cctcaccact 

tcctgataga 

agaatctccg 

cctgtaatcc 

agaccagcct 

gaataataat 

aagtgtacct 

tgactttatt 

aaataatttt 

tctcccaacc 

cttactttac 

caaatatgat 

gtggctgcat 

tggggaaatg 

tattcttaag 

aggctcttgt 

gtaggaacac 

gctaatttaa 

attaacaggt 

agtagtcatc 

tgtgagcgag 

aggctcctga 

gagaaaatgg 

tgggggcctg 

gtgtccccca 

atacatgtac 

aaggaattgt 

atcacctaat 

agagttgtgc 

tcaaattaag 

gcttccccaa 

catagcacct 



taatgaatga 

ctttggagtg 

ctgggggtgg 

aagtaataag 

tgtatatttc 

aatgcttgtc 

aagcagggtt 

gatatcctaa 

tatcccataa 

cacatctgfca 

ttcaagatca 

cgaattagcc 

ggaggattgc 

ctctccagcc 

aaataaaaat 

gtactagcct 

cctgtgtgcc 

gtcattataa 

acagtaggag 

tcctctttgt 

cagagactga 

gggaaaggtc 

cataaggcag 

tatactgtta 

atgattccaa 

aaaattgagt 

agctatcccc 

tacaatgcca 

ccagcttttt 

tgtctagtct 

tcaataaaat 

ccagagttga 

ccgcatgcat 

ggaggaccat 

aaggagctct 

cagctactca 

gcgcaacata 

agtaataaat 

aataaatgtg 

ctataagcaa 

ccaacattaa 

ttcaataatt 

atggtcgcaa 

gttataaacc 

aatattgcat 

gaggataatg 

atcaaatccc 

tatattttac 

agacgtcatc 

taagtatgaa 

tgaactattt 

cacttccttc 

cacttcagag 

gcaggggaat 

gctgagaggc 

aactggggca 

tattagagct 

tcactctccc 

tctgtcctgt 

gaagggatgc 

aaaatgcaac 

agagtctaca 

ccccacctcc 

tgcaccatga 



gagtgccttt 
catggtggag 

ggggggagaa 

ttctaatgtt 
aaaatagcta 
tgcggccatg 
gaacctggtt 
ataccctgtc 
atatgtacaa 
atcccagcaa 
gcctgggcaa 
aggcgtggtg 
ttgagcccag 
taggcaacag 
aaaaagactg 
tgaaccctgg 
tcactttctt 
aaagccaata 
tttaataact 
ccactaatgg 
gggcaactgg 
tgggagacag 
ttcagttttt 
atgttttcat 

ggggattctt 

tgatttaaat 

aaaatcatga 

attccagaat 

tgctaactgt 

gtaaattgaa 

cacaacatgt 

ttcggctgat 

tcctcccgaa 

tcttcagtca 

cagtaaaatc 

ggaggccgag 

gtgagaccct 

cacctgtgca 

atcattgtaa 

taaaagagga 

caaagaacat 

ttcaattttg 

tcagtgttca 

ggtctccatg 

tgactatgtt 

ccagggtcat 

agcagtgaga 

tagattgttt 

caaccttgcc 

atgctatcct 

tccaagtatt 

acctgtttat 

tcaatagaca 

gcatgatcaa 

agagatatgt 

acggcctgag 

tgcggcactt 

ttgcaaatct 

tcacttactg 

catccccaac 

agggaatcat 

gcaactaatc 

atcctagact 

ttgtttgatt 



112320 

112380 

112440 

112500 

112560 

112620 

112680 

112740 

112800 

112860 

112920 

112980 

113040 

113100 

113160 

113220 

113280 

113340 

113400 

113460 

113520 

113580 

113640 

113700 

113760 

113820 

113880 

113940 

114000 

114060 

114120 

114180 

114240 

114300 

114360 

114420 

114480 

114540 

114600 

114660 

114720 

114780 

114840 

114900 

114960 

115020 

115080 

115140 

115200 

115260 

115320 

115380 

115440 

115500 

115560 

115620 

115680 

115740 

115800 

115860 

115920 

115980 

116040 

116100 
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agcacctccc 
ctgcagggct 
tcagcaagtt 
aaacaatggg 
ggttaagact 
agcctggctt 
ccagctgccc 
acagcctggg 
cctagcctcc 
ctaggttcaa 
gccaggcccg 
cacttgagga 
aaaatacaaa 
tgaggcagga 
attgcactcc 
gcagcttccc 
atgcacatga 
ccccttggcc 
agcccacaaa 
ctaagtgtgg 
cttcttgccc 
acagggctca 
ctgtttctcc 
atcagccagc 
tataaataat 
aggaggatgg 
aaaaatacaa 
gctgaggcag 
ccattgcatt 
catacataca 
accctaaaaa 
aatttaggtt 
ccaggcagaa 
gtccctccac 
gcctggtgac 
aatctgattt 
tgggatctag 
ctggatgata 
ccttcaatga 
tcggaggcag 
cacagtgggc 
tagtacacgt 
cactccttat 
tgggaggcca 
tagtgagacc 
agctacttgg 
agccaggatc 
gtctcaaaat 
caatgcctat 
accaaataca 
aaacaactct 
ttatctatgt 
agtcaccagg 
tttattgatt 
tcgacagggc 
ttgctgggat 
actggggttt 
gagccagggc 
acactaaccg 
ggaggttctg 
cattgagagt 
gtgccaataa 
atggggtggg 
tctccagaga 



acagtagact 
gtgccaggtg 
cagagtcaag 
actcaacttt 
ggtgtggagc 
tgcagctttt 
ctaacacctt 
aggcaccacc 
caactgcaag 
atttagaatg 
gtggctcacg 
cagaagttcg 
attagccagg 
gaattgcttg 
agcctgggca 
tccacttccc 
atatccaacc 
accctttggc 
gggccgggaa 
tctagaagag 
ccatggaggg 
gggaaggact 
atctgtaaga 
acacgtaagt 
aggccaggcg 
atcacaaggt 
aaattggccg 
gagaattgct 
ccagcctggg 
tacatacata 
tgagacaggg 
ttcttgcctg 
caaggtgtgg 
ctcccagcac 
cagtgggaga 
tcccccaggg 
ctgagaggct 
cgaagatgta 
tgggacactg 
agaccaggct 
agccctagca 
gggtaggaag 
gagaccacag 
aggcaggcag 
cccacctcta 
gaggctgagg 
atgccactgc 
aaataaataa 
ttctccaggt 
gggtcagcta 
atgagctggg 
atttatctat 
gatggcctca 
gattgattga 
tggtcttgaa 
taccaacacg 
gaaaaggtaa 
acaaacccat 
ccaagtgttc 
tggactcact 
agctgcgctg 
acccctcctt 
cgggggcggg 
tcttggcttc 



gtgtttctga 
cacaaaataa 
tgggataggg 
ctaaccaaga 
ttcattaaag 
aagtcatgta 
gcaggggcag 
ccactagtgc 
catcaatctt 
atccagctcc 
cctgtaatcc 
agaccagcct 
catggtggta 
aacccaggag 
acaagagcga 
aaccacagct 
aacatgtcta 
catgggttca 
gtccacttgg 
agatgcagct 
aaactggcta 
ctttttgacc 
aggggacatt 
tcatgacatg 
tggtggctca 
caggagtttg 
ggtgtggtgg 
tgaacccggg 
tgacagagca 
catacataca 
aaagagaaaa 
ttttagaaag 
aactcactgt 
agtaggcagc 
gctgaggagc 
cccacatcac 
ccatttttgg 
gctttgcagg 
aagcccacag 
agaactcagg 
agtgccaggt 
tcatatctaa 
gtgctgggtg 
attgcttgat 
ccaaaattag 
tgggaggact 
acaccagcct 
atgacagcag 
agtcactagg 
ctctgttatg 
aacaagtatc 
ctattcattt 
aacttgagct 
ttgattgact 
ctcctagtct 
agccaccatg 
gcgacttgcc 
caaagcctgt 
tacacagtga 
cggtgggtat 
ttagtaagtg 
ctacctggtc 
tgtcctcctc 
ccgtggggct 



taggtcagca 

acaaagccaa 

aggctctctt 

gaactccagg 

aagaaaagat 

acctttgatg 

ggctggagtg 

aagccgggca 

gcacttggaa 

ttgaagttct 

cagcactttg 

gaccaacatg 

cacgcctgta 

gcagaggttg 

aactccatct 

ccatctcaga 

aggcccaacc 

tgcactggca 

gctttttgag 

tctgggaggc 

gatgagggcc 

agatctaaga 

aatagactct 

aagcaagggc 

cacttgtaac 

agatcagcct 

cgtgcacctg 

aggcagagtt 

agactgcatc' 

tacatacata 

acatgttctg 

ggcctggaca 

ggaagggttc 

acgtgtctcc 

ccagggtggg 

gtgcccagag 

tagcttctag 

actctctaga 

aggagaggtc 

acttttaatt 

agggtggaac 

gaactgaccc 

cagtggttca 

tccaggagtt 

ccaggcgtgg 

gcttgagcct 

ggatgacaga 

atcatcattt 

ataaaagtaa 

ttctttcatg 

gtccttcttc 

atttatttat 

aggaactagt 

ggttaatttt 

taagcaatcc 

cccagcccct 

caaggtcccc 

gctctcgccc 

aggtgacaaa 

gcccagaggg 

agaactcgga 

tcctttccaa 

cttctcaggg 

gcagatccac 



acatttgctg 
agacaacatg 
cactggaagg 
gagctaaaat 
tcacccagac 
aagttatgtg 
caaagggagg 
acctctgccc 
aggaacctca 
atacagaaat 
ggaggctgag 
gtgaaacccc 
atcccagcta 
aagtgagcca 
caaaaaaaaa 
caacaagggg 
acaccctctc 
gaaaggtagt 
attccagggt 
acattccttg 
aaagcagagc 
gcagcactac 
ccccgctaga 
t.taatatata 
cccagcactt 
ggtgaaacct 
tagtcccagc 
tgcagtgagc 
tcaaaataaa 
catacaatac 
acaaccttgc 
ggagccctgt 
tgggtgacaa 
attgactggc 
gtctgaagga 
ctgggaaagt 
tttgggagtc 
acatggagtc 
tgtcccagtt 
tggaccagga 
tgtgaaggtc 
ccagacctgg 
cacctgcaat 
cgagaccagc 
tggtgtctgc 
gggaggcgga 
gtgagacaga 
ttctttctgc 
aaataatatt 
ctttgtttct 
ctcccatctt 
tttgagacaa 
gtcaccaccc 
gaggcagggg 
gcccgcctca 
cccatcttct 
tccctagcta 
attgagccac 
gaggtgaagg 
aagggggatc 
agtccagact 
agccagctgt 
aaaatccgac 
ctagagccac 



agcacctact 
gaccctgaac 
taactccaag 
tctgacttct 
ttgagttcat 
acctctccac 
cactggtacc 
ccaaggcatc 
cctttgaaat 
acagccagca 
gtgggtggat 
gtttctacta 
cttgggaggc 
agatcgtgcc 
caaaacacac 
cctcatgtcc 
caaacatctg 
tcagagaaga 
ccaggataac 
gtcttaggga 
cctctaaagc 
ctctctgagc 
gttactctac 
cccgttgtac 
tgggaggccg 
catctctacc 
tacttgggag 
caagatctca 
taaatacaca 
atggacaggg 
cctttatact 
tcccctcagg 
gtgcagcccc 
tcaggagcag 
atccctagaa 
ggaggcagca 
acagagacac 
caagatattc 
actcagccat 
ttccttttac 
atccgagggg 
ctctgccact 
cccagcactt 
ctgggaaaca 
ctgtagtccc 
ggttgcggtg 
atgacacact 
ctctagactg 
atcagcattt 
tttaagcctc 
atttatttat 
ggtccttcta 
cccaatttct 
tctcgctaag 
gcctcccaaa 
gaataggaaa 
gagagcttca 
cggacctcgt 
gaagagccag 
ttgggtggca 
catccagtct 
tctccagaca 
gctgagccca 
cagagggcgg 



116160 
116220 
116280 
116340 
116400 
116460 
116520 
116580 
116640 
116700 
116760 
116820 
116880 
116940 
117000 
117060 
117120 
117180 
117240 
117300 
117360 
117420 
117480 
117540 
117600 
117660 
117720 
117780 
117840 
117900 
117960 
118020 
118080 
118140 
118200 
118260 
118320 
118380 
118440 
118500 
118560 
118620 
118680 
118740 
118800 
118860 
118920 
118980 
119040 
119100 
119160 
119220 
119280 
119340 
119400 
119460 
119520 
119580 
119640 
119700 
119760 
119820 
119880 
119940 
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gccagcactg 
ccagtgtcca 
agcccaactc 
agaattactg 
acctcatttt 
cgatgaccgg 
attgtctagg 
ctccccccaa 
gtaagagggg 
atgagaccag 
gctggaaaca 
tgggggtgga 

gcggggatga 

ctttcctcgc 
cttgggggcc 
gacagcacac 
gtgatatttc 
acccagcgac 
acccccaaca 
tcggggtggg 
ctgggtgagg 
acgagagggg 
ccgaaaatgg 
ggaatgaaac 
acgagagaca 
gacagtgaga 
gagagagagt 
cgggggagaa 
agcagagaat 
cactgcattg 
aggggcttct 
cacctgcggc 
gctcccccga 
gagtcaagac 
ctgctgatac 
tccgcccctc 
gctttgggtc 
acataca caa 
cactaagggg 
gcgcaaaacc 
ttcagcggct 
caaagggtaa 
cacgctcata 
gcacactggg 
ggacagaaat 
gggtgaccca 
cggcgttcct 
gcgaggacga 
gagggggtta 
cagcaaacga 
cctcgctgtc 
gactctcgag 
cagccttccc 
ggcccgcagc 
tcccgggccc 
gagtgggagc 
gggctcaggc 
gggagagaag 
ggtaacctgg 
tctcagcagg 
ggtgggacgc 
tgtccccgct 
actcagcaac 
atgtgggtag 



cggccaaggc 
agagtccagc 
atgttcaggc 
ccaaaatagt 
accaagcact 
gtagcctcac 
gtcacctgga 
ccctaaccgg 
tcagagtgct 
ccagccaccc 
cgacaatgga 
gggggtgagt 
gcagaggggc 
agctcgtccc 
cggggtcccc 
ctgagtcagc 
gctccccggg 
agattgtgcc 
ctccctcccc 
tcctggggaa 
gaagacggga 
cgggaaagga 
aaccagcagg 
tggggaagag 
gagacgcagg 
aagacagaag 
gagggaggga 
aaccagagaa 
gaatgggaag 
gaccctggca 
gtgctcgcgg 
tccagcagcc 
gggccccgag 
ctgggtcaac 
cgcctgtgac 
tttaaaaggg 
ctctcctgca 
aaaaacgcac 
catacacaca 
tcgcacagcc 
cacccgtgtg 
cgggcaagca 
tgcactcgat 
cttgaggtct 
gctggaggct 
gaagctcatc 
ctaaactacc 
tcgaacacag 
gaaagccgct 
gggaggagcc 
cccactctcc 
cggacagcgc 
cgccccgtcg 
ccgggcggcg 
tcccggagcc 
gccctccccc 
gcctgctgca 
cagggcgtcc 
ggaaggggaa 
agggcacttg 
agggggcagg 
ccacctgctc 
tcctcctgcc 
ggggctgtct 



ttgaagaccc 
cagaggccga 
atgatgtgtc 
aacgacatta 
ttctcatgcc 
cgtacagatg 
gaactctgat 
agctaggtgg 
ctacagagaa 
acccaccagc 
tgacacgagt 
cagcagataa 
tgcgactgcc 
aggaggagga 
tcccacccct 
ccgccgccca 
agccagcccc 
gcggctcatt 
gccgccgcct 
acgcagggtc 
gccccagaga 
aatttagggg 
gggcacccga 
ggggagagaa 
gagccccgcg 
accgggcagg 
gggaacagag 
atcgaatgag 
aataagacca 
cggataggaa 
gagcggcagc 
ccaaccccgc 
cctatcctac 
cgccctgcag 
caggccatga 
ggtagaggaa 
cccccgcggg 
acactcgcac 
ccgggcacat 
tcacgtttcc 
cacactcacg 
tcctgagtca 
cttgcacgca 
gggagtggaa 
gggacactgg 
ttctcctgga 
gcattccccc 
tcctccgggt 
cccgcctcct 
agcgagtgcg 
ctcggctagc 
agctagcggg 
cgccccgccc 
cagggtagag 
ccgcggggtc 
cgctgccccc 
aggtaagaac 
cctctttcag 
aagctcagcg 
gctgggagcc 
tggacccggc 
cgggggacgc 
cggtgcctca 
ggggaggtag 



agcacaccaa 
gtcctcgatc 
tgattctact 
gctacctacc 
tggggtgcct 
aggaaactga 
ctccagactc 
ggtggggaca 
gaccaaatgc 
cagccaccta 
tccctgccct 
ttatgggaaa 
tggagccagg 
gctccccccc 
ccccgaagag 
cccgcccctc 
actgcgctcc 
ccggggaagg 
ccaggccctc 
ctgtcctgcc 
cttttctttc 
cacagagagg 
gagccgaggt 
agggaggcag 
gggaggagga 
gaaacagacg 
agacagagac 
aacgcgagaa 
acatttatca' 
aggaggagcc 
ccagggggct 
cagcgctgcc 
gccggggcgg 
cctttgtagg 
agggccagag 
gttcacgcga 
agataaggtc 
gcgcgcccat 
atttctttcc 
caccagctca 
tgcccccccc 
cacctgcaca 
caaactcttg 
ggaaaagtgg 
cgcgagggac 
aagttgggag 
aagaagggat 
cgcttaagcg 
agtggtcgag 
ggaaggagtg 
agcctgggca 
gcgcgggcgc 
cgtcccgtcg 
cgccgcggcc 
cccgccgtgc 
tcccccgagc 
gccagcggcg 
ggattgaggg 
ctggggccgc 
cgcgggcgcg 
ccggagcggg 
tgaggactcg 
gcactttctg 
gaggcgcaga 



agcccggcca 
tcaaaatgtc 
gggacaatca 
acccctccac 
ggagacttaa 
ggcacaagga 
aaatttccaa 
gcaaatgtgg 
attgtggcac 
aaagtcttca 
caaaaagctg 
ccgtgacacc 
gattcccgga 
agcttcgggg 
cgcgggcccc 
agcgtctgtc 
ggaggcagct 
acgccaaacc 
cccccaggcg 
tcctggaaat 
tgtttctacc 
agctgggggg 
gcccacgggc 
agacaccgag 
gagagacgaa 
agtagagaca 
cacgaaatat 
gaacgagaga 
agagccgact 
gcggcgcggg 
cagcagcccc 
tggccaccgt 
ctccgcggac 
gaagtgccta 
gggctccagt 
agccaacagt 
tcccctcccg 
ctcgcacccg 
acccatcccc 
gacatgcacg 
cccgcttccc 
agcatccttg 
catatactat 
aatcttggag 
gcggctgggg 
gggggaacag 
ttctctagaa 
ggggggaggg 
aaagggttaa 
ggggtggttg 
cacggacaga 
tgggcgtcga 
gggccgatgg 
cggccacgca 
atccggcggg 
atcgagacaa 
ggagagcgga 
tggggcagtt 
gcccccgccg 
tgcgaggagc 
gagggaggct 
ggccggctgg 
gccacctggg 
gggaaatcca 



agcctccagc 
taactgcaga 
ttgccaccaa 
acaccaacac 
tgcagcctcg 

aggggagtac 

ttcgtccccc 
atggggggag 
ctactgtaaa 
gtgggcacct 
gtagtctagt 
tgtataaggg 
cggggcttcc 
ttccgcctgc 
gggaaccgat 
tccgcatctt 
cggcaaacaa 
ccaccctgct 
caggccctag 

agggggagcc 

tgatccgaaa 
ccgagaaggt 
cgggagcctg 
acacacagag 
gacacagaga 
gaaaaggtcc 
gagtaagagt 
ccgtggaggg 
gtatgccagg 
cagcggggcg 
ggcaccgccg 
acccgaagcg 
gcgccgggcc 
ggtgatgggt 
gagaccataa 
cttctcccca 
gacacatcat 
cttgtaaatg 
aagatcgcaa 
ctggcggact 
caagcccgta 
cgcgcacgtg 
tcttatagtc 
ctgtcccagg 
gcgggggagg 
gacaagtcca 
gagtggcgcc 
gggcggggtg 
gtcggcaagc 
ggaagagctt 
cggactgacg 
cggccagccc 
ctcctcccga 
gcccggggac 
ctcagggagc 
gatgctgccc 
gggcatcctg 
ggggaggtgg 
ccagggctgt 
tcgtgaccga 
caggttccgc 
ggaagcgccg 
aagacaggag 
agtggccctc 
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tctggtagga 
aagggtgtgg 
aggagaactc 
gggggagggg 
ggtgaagacc 
agactatgga 
gggaccctgg 
gggatggtga 
atggatggga 
agggactggg 
tccacgtggg 
gtggctggtg 
gacagagggt 
gctcctgtgc 
accacggtgg 
cagctctaaa 
accaccacct 
tgcccgcagg 
gtctgggacc 
atcctttcct 
ctgcccccta 
cggcccaaag 
ttgatagcca 
cttcagcctt 
ttctggccat 
tctagaacaa 
ttccttgcct 
gcacctgggt 
cgtggtttct 
tgggcttgat 
ggccactgtg 
ttgacctcca 
tgcagtgcct 
ctgcagctct 
aggctgctac 
ctgagaaaga 
aactcacact 
taggtggcca 
ctccaccggg 
ggaggcctgg 
gggaagggac 
ggggagagag 
ccctggagtg 
cctcccctcc 
gttacagctg 
gcagcagctg 
ccactatcat 
aggagagtgg 
ctagggagag 
cggaaggaac 
ctgtctctcc 
gggctagcgg 
ggatcagaga 
ggtgactctg 
tccctgccaa 
tcatcacctc 
gaaatctcca 
tgtgcttcca 
accccacgtg 
aggggtgaga 
tgtcactgag 
acgcggagga 
gacactcacg 
ccgcacctgt 



gagatggagg 
gccagagact 
aaagatggag 
gagaaggaat 
aaggaagagg 
cagggaccca 
ggactggaat 
ctagagaagg 
gaggggctgg 
gattgggggt 
aagccctcca 
ggccagatgg 
gaggactttg 
tgccatttat 
ctgcttattt 
cctccgtcac 
ccatcccact 
ccccagccct 
ctgtgcccct 
gggaccactc 
ctttgggtcc 
agtggcctgt 
gagtcctgga 
ccgggatcaa 
atattctgga 
ccgcaactgg 
tgaccctgaa 
gcaatttctc 
gcggaggatg 
gtccctcttt 
tcatgtttct 
ggggattatg 
aggacaaaat 
agccatcccc 
aaagggcttt 
caatgaaggc 
tcccctcccc 
ccgaggtatc 
aattcacaaa 
gcctggaggg 
gtggcaagaa 
ggaaaggcac 
ctcgctccag 
tccttgcctc 
ctttgcacgg 
ttatggagga 
gggctggggg 
ggaaagagaa 
aaagaggagt 
cctgatgcca 
tggggcagcg 
ccatcagggc 
aggatttggc 
acaggaaggg 
cactttgctg 
agtagaactg 
gttctaagcc 
cactccacct 
ggtctaagaa 
agggccgaga 
tattacatca 
cagccagcac 
gtggggccca 
gtgctccccc 



gcgctagaaa 

gggggtgggg 

aaagtggtga 

aaaggttaga 

gaaggagagg 

gaatcctggg 

ggaaaaggag 

ggtatcagga 

agaggaggga 

gctgcccagg 

ttcccaaagc 

ggggtgcttt 

gaaggggagt 

ttatccggcc 

tggggtgtta 

ccgcgtcagt 

cctcagctcc 

tccctgaagc 

ccttccagag 

tgcaccagcg 

agttttcttc 

gggctgagaa 

cagtggtccc 

taccatgttc 

caggggtcat 

aagcctagga 

tagccaaaca 

tcctgtcttt 

tggccacctc 

tccaatctgg 

ggggctccct 

tgtgttgtgt 

aggtgcttct 

tgctgacttt 

cctctgtcca 

gagctgtaga 

agccctcaat 

cccctttcca 

atgaaggcca 

aaggagtggt 

agaaaagggc 

agctctctag 

ccccagcagg 

cagtgagagc 

cagtggcaag 

gcccccagca 

aggtgagccc 

gagaagtagt 

aggcaccctt 

ggggaatggg 

gtcagtccgg 

tcacactctg 

ttcttttggt 

aagtgccctg 

tgtgaccttg 

atgcttgaac 

ccaaacctca 

ctgcctggtt 

actcgggagg 

aacgcggagg 

cacccagcct 

gcaccgacgc 

aaagcgagga 

ctctggagga 



gaggatagtt 
tggggagggg 
gggaagctca 
tgggaaaagc 
aaagaccaga 
atggaggtag 
aatggaaaga 
accggcgaag 
agaggaggag 
gatgagacaa 
gcctgcctgc 
gagctcaggg 
gacagcctcc 
cggacgttgg 
cattctggca 
gcctccccag 
cacctcctca 
agcccgttgg 
cgagaggcct 
actctgccct 
tcctcaagtt 
ctttgtttct 
tcagtgaaca 
tggcctctct 
ctcttcttga 
ggtgaaagtt 
gactcagcat 
acctcaaggg 
ttaaagaaag 
gttccacctt 
tccttcatct 
tcaatgtata 
tggtttcctc 
agaggagggg 
tgaagtagtg 
gacctggtca 
ggtgttacct 
gctctgatac 
ggagtggagc 
ggtgctgagg 
acacactggg 
tcccccaacc 
cctggggcag 
cgctgcgtga 
ggccagaaat 
ccgggtcgct 
tggttggggg 
ctaatgtggg 
gccagctcct 
ccttgcctct 
aggctggggg 
gtgagcttgt 
gtcaagtcct 
gctgggcatc 
ggtaagtcgc 
cagaggaatc 
accctcatga 
cagatgaggg 
agaaagtaat 
tggtgtgaac 
acacacgcac 
agcaccgacg 
gcagcacact 
ggaacaccag 



ctactgattg 
tcagggggag 
aaggaggagg 
gtggagggaa 
tcaggggagg 
cgggaaagag 
tcagaaacca 
agggttggag 
gaagagaaag 
agaggcttct 
cacatttctt 
ccctgggggt 
gagggtgggc 
attctgcagc 
gagtgagaag 
gcccctgcgt 
gcccctgccc 
gtgtggagcc 
ctgctgcctt 
gtgggtgggt 
ccttcttcta 
gagccttggt 
gatacttttg 
tggctccctc 
ctcccacatg 
gcagagagag 
tgtggctggc 
cagtgtctca 
atcagagtgt 
gtactagctg 
gcaaattggg 
aagaagttaa 
ctaccctgct 
gtgagcagag 
gagggatgaa 
ggaggcctgg 
atgatgtgag 
tctgtgcatc 
cgtggtcctc 
aggagtgaga 
cagggcaggg 
ccccagtccc 
tgaagcccag 
attatggatg 
ggcaacagag 
cttcagagag 
agacatggga 
caggtgggga 
gcagagttta 
gagattgcac 
aaagctctgt 
ggataagggg 
tagggaagtg 
aagagacttt 
ttgctctctc 
gaggggacct 
aacccactca 
gtaagagaca 
cgtgaaacgc 
gaatggaaca 
ggggcccggc 
cagcgccagg 
gggagtgtgg 
ggcagctggg 



agtgacagat 

agggatagga 

gagatggagc 

gtgggaccca 

gatgggaaga 

aatcaggact 

gagaaggatg 

acagggaacc 

gctgagagag 

ggtaaccact 

ctctcaggga 

ggctgtgagg 

aaacaggctg 

cgctgccgcc 

ctgtttgcag 

cactggcatc 

cctcagcatc 

cttgcttctc 

tccagggagc 

agcctggatc 

caggggcctc 

actccaaggt 

gctctggaca 

ccctggtcag 

taatcactac 

ctggagtccc 

ccagccctag 

cacattcagg 

ctctctgaca 

catgacctga 

ggccacaata 

cctgtacaaa 

gtactctccc 

agggtggggg 

atgaaggctt 

ggtgctcaga 

gggtcggctc 

ttgccccagt 

gggagagaca 

acagggggtg 

actgagggcg 

accacctctg 

agccccctcc 

agctccttgg 

tcactgttat 

cctgcaggga 

acaagatgga 

gcaggagagt 

ccctcaaggc 

atccttccct 

aatcctccag 

taggattaag 

gagatcagag 

tctggccctt 

tgagctccag 

ttgcggcttt 

gggtccccac 

ttgctcctcc 

cgcacggggg 

gcagccgctg 

gctcacacac 

aggggccggg 

atcttccacc 

atgccagcgc 
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cacactcggg 
gagcacctgg 
gcaacctcag 
gccttggaga 
cccaaggagc 
agcctaggac 
cctggctttg 
aggagtcacc 
tatcagctat 
tatgagtaca 
catcagtatc 
aaaaaaaaaa 
gccaaggtgg 
gaccccatgc 
tgtcacccag 
gttcacacca 
atgcccggct 
aggatggtct 
attacaggcg 
ggcacgggcc 
agtgatccaa 
caagaccctg 
agcctggggt 
aggactggct 
ggctctgact 
tggcaagacc 
ccaagttact 
atgtctggag 
tagaagacag 
ccggcccaac 
tctagacaag 
ccaatttgca 
tcagaaagtg 
catgggagca 
ggtcttgttg 
aaactcctgg 
acatgccacc 
tgttgcccag 
aagtgctggg 
ttggggggct 
tcccctcccc 
ggcggccccc 
ccaggccaac 
cctgcagaac 
cctgtggctc 
agccctggag 
cttccagggc 
gcccggcaac 
cctgctccac 
ctctctgtgg 
acccttcctg 
ctccaggggc 
tctctctctc 
gtctatccct 
tgcctcttct 
cgtgtgtgta 
cccccagggt 
gcccacatca 
aagttgcgct 
gattcccatt 
tgttatgcag 
attgttatta 
aaatcatccc 
aaaatcttcc 



gcctgtcagt 
ctcacgccac 
gagcgtgaca 
ccccacactc 
cccaggatgt 
gttgcttgaa 
ggggtgcagg 
tggaaatcat 
agtgacggca 
gacacctgca 
ataataagga 
gctttggctg 
gaggattgtt 
ctacaatttt 
gctggagtgc 
ttatcgtgtc 
aaattttttt 
cgatctcctg 
tgagccaccg 
tgtggtccca 
gctgcaataa 
tctccaaaag 
gtaaaatgca 
ggtggcctca 
ctgcaagggc 
agaccccaga 
tccagtcatt 
acattttggt 
agatgctgct 
tgtcaatact 
gtcaaacatg 
gtgctttcta 
gaagtggcca 
ccagggctct 
tgttgcccag 
gttcaagcga 
atgcctggct 
gatagtctca 
attacaggtg 
ggcccccagc 
acagctcccg 
agctgcccca 
aacttctcct 
aacctcatcc 
ttctccaaca 
gagctggacc 
ctggagcggc 
atcttccgag 
ctacaggtga 
gcccctctgc 
cctcagcatc 
tttacttttt 
tctccctcta 
ttccatccat 
gtctgtctcc 
tctgtctctt 
cccctgccca 
cggtgaagta 
tctctccaat 
ttccaaacct 
ctggagaatg 
ttattgatct 
actccatttc 
cttggctaca 



cccatgcgtg 
tacccaaaat 
caacacacac 
aaaatcacca 
cagagtgcag 
gcagaaggtg 
ctcccttttt 
aagaaagtgt 
aaggccaggg 
ctccactctc 
tggatcatat 
gatgcggtag 
tgaggccagg 
tttttttttt 
agtggtgcga 
tcagcctccc 
tttgtatttt 
acctcgtgat 
cgcccggcca 
gctcctcagg 
gctgtgatcg 
aaaaaaagaa 
gattcccagg 
cttagagacc 
gaaaagtaca 
gtgagtccat 
ctccaatggg 
tgtcacaact 
aaatgcctta 
gaggcagaga 
caatagtgaa 
caatggcctt 
ggccacttga 
aggcctttat 
gctggagtgc 
tcctcctgcc 
aatttttttt 
aactcctggc 
tgagccacca 
tggcactcct 
cctcggcctg 
tgctctgcac 
ctgtgccgct 
gcacgctgcg 
acctctccac 
tcggtgacaa 
tgcagtcgct 
gcctggtcag 
gcctgccctg 
tccccgaccc 
tccatttctc 
cccttctgcc 
actccacaac 
cactgcctct 
cttcacacac 
tctgtgatct 
aaggcctttg 
gagagagaag 
tcactgggca 
gtcatctcaa 
gatgctctga 
aattattgtt 
cccaggaagc 
gagcctccgg 



cacacctggc 
cacagataca 
aaaaccacca 
acccctcagt 
aaacaagtct 
ttcagtcact 
ccccaggcaa 
agaggtcaag 
atgatgggag 
tagcccccag 
ccaaccttca 
cttatgcctg 
agtttgagac 
tttttttttt 
tctcggctca 
aagtagctgg 
tagtagagac 
ccacccgcct 
aaaattttta 
aagctgaggc 
taccactgca 
agaagttttt 
ctgtcccccc 
cgacccttaa 
ggaaagtaag 
ttcacacggg 
gtgactttgc 
ggaggcaggg 
tatagggctg 
aaccctgacg 
aacaggaatg 
ttggcattat 
ggctataacg 
ttttattttc 
aagggtgtga 
tcagcctccc 
tttttcttgt 
ctcaagcaat 
tattcagccg 
gccctggaag 
cctcctgctg 
ctgctactca 
gtccctgcca 
gccaggcacc 
catctacccg 
ccggcacctg 
gcatttgtac 
cctgcagtac 
cccccaccct 
tggcgtgcgt 
tctgtctatg 
tctctacctg 
cttcacctct 
ctcactaact 
ccactccgca 
cacgtgtttg 
cagctgtttt 
gcagagccac 
atgggacggg 
tgcaggggaa 
aaatggaagg 
tattgttgtt 
aataacacac 
ctggaagggg 



tgagcagcac 
tacacacatt 
ctaagcaagt 
ctctcccagg 
tcctcccctc 
gtgtgcccag 
aactgccaga 
ctagttccgg 
gccctgcacc 
gctctctggg 
aaagttactt 
aaatcccaac 
cagactgagc 
tttgatacag 
ctgcaagctc 
gactacaggc 
ggggtttcac 
cagactccca 
aaaaattagc 
aggaggattg 
ctccagcctg 
aagtaactgc 
aggaattctg 
ggcccctccc 
ggcactgggc 
cctcagatct 
cccccagggg 
tgctgctggc 
cccccacaac 
ttagtctttt 
aagagatgat 
tttttaatat 
ttgtcccctg 
tattttttcc 
tcgttgctca 
aagtagttgg 
aaagacaggg 
cctcctgcct 
ggtctaggcc 
cccacctagt 
atgctcctgg 
tccccgccca 
cccagcactc 
tttgggtcca 
ggcactttcc 
cgctcgctgg 
cgctgccagc 
ctctacctcc 
cagccccttt 
ccctcctctc 
tctcttttct 
tttaggtccc 
ctgcctctgc 
tgcctccccc 
tacaccccca 
ccttcagggc 
tctcacccac 
agccactggc 
agaagcccac 
gaaagaaaag 
aataccagta 
atgctgactg 
cctccaaacc 
gtgaaaatat 



tgcatttggt 
cacgcacacg 
gcaatttgca 
gtctctgaac 
tgccttcaaa 
ggaatgactg 
agaaaatccc 
cctagaactt 
cctattaaaa 
cctgcttttc 

tgggggaaaa 

actttgggag 
aacatagcaa 
agtctcgctg 
cgcctcccgg 
gcccgccacc 
cgtgttagcc 
aagtgctggg 
tgggtgcagt 
cttgagccca 
ggcgatggag 
gaatgaggag 
catagttcct 
ggcacaaaga 
accagtgggc 
ccaaagggtc 
acatttggca 
atctagtggg 
gaggaactat 
gacattaatc 
cattcttcaa 
atgagaagcc 
agcccccaga 
cctgaaacag 
ctacagcctc 
gactacaggc 
atctccctta 
tggcctccca 
ttttaccaag 
aagttctgct 
ccctgcccct 
ccgtgagctg 
agcgactctt 
acctgctcac 
gccacttgca 
agcccgacac 
tcagcagcct 
aggagaacag 
ctggtttcct 
tccccaggcc 
ctcttacatt 
ttgctgttcc 
ctgtctgtct 
atctgtcttc 
tgtctgtctg 
actctgcctt 
cctcaagtct 
atcccacaga 
accccttcta 
ggtaaatctc 
attgttattc 
tttgacacgc 
accctgagag 
ccaaattctg 
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ccctctccct 

ctagttatca 

caatagccat 

tacagtagtg 

gagggaaaga 

ttgcctaaat 

gccacagtgc 

ttttttttac 

catctcctag 

gtgcttggca 

aacgagcctt 

atgtgggtca 

acagtttatg 

ttttcacctc 

gatatagtca 

gtcttctcac 

tagggacatg 

tcatggtctg 

aagcacacag 

tttggacaaa 

ggatagtact 

tggagtgcag 

ttcctgcctc 

tttttgtatt 

gacctcaggt 

ccatgcccag 

atttagcaca 

actactacta 

atgctttcaa 

gggcctgagc 

ctaaccctct 

ttccaccaag 

gctggtgtgg 

acttgaagtc 

aaatacaaaa 

taaggcagga 

actatactcc 

aaaagcccat 

ctcaggccaa 

caggtggaca 

atggtaccag 

gaaaacacag 

aggtgaggaa 

ccaggcctgc 

gaatggggaa 

gtatctacca 

gtggccctgt 

tgttttcttt 

agtggcacga 

tcagcctccc 

tttttagtag 

ggtgatctgc 

gccgacactt 

ttttgttttg 

cgcaatcttg 

tccccagtag 

agagacgggg 

acccgcctcg 

ctgaatggtt 

ccacacctct 

ggacccatct 

aggagatact 

tagctatcca 

ccaaggtcac 



acttgaacct 

atctgctagt 

aacaacctaa 

aacagaacag 

agtaaaatgc 

agtgacagta 

caagcacttt 

ctcctcagac 

ggattttgaa 

tacggtaagt 

atttaacatt 

acttcaagga 

aagtgcactc 

cttgttaaca 

cctccaatca 

catcatgttg 

gaatgaatgt 

gctacaaggg 

tttggactca 

ttacttaacc 

taataataat 

tagtgcgatc 

agcctcctga 

tttagtagag 

gatccactca 

ccaataataa 

aggattaaaa 

ctactactaa 

aatctcctta 

ttgaatcccc 

ctgaacctca 

aatggctgta 

tggcccatgt 

aggagttcga 

attaggcagg 

gaatcacttg 

agcctgggtg 

agttcagtgc 

caacagcaat 

ttggggtctt 

gcatcatggg 

cctctgcctg 

acatgggctg 

cccaaagggg 

gggccaggca 

catgccaggc 

tctcccagag 

cttttttttt 

tctcggctca 

aagtagctgg 

agacggggtt 

ccaccttggc 

gttttctctt 

tttttgtttt 

gctcactgca 

ctggaactac 

tttcaccatg 

gcctcccaaa 

taaaaatagt 

aatttgctct 

gtgaaatggg 

actgagctca 

ttaacaccag 

ccagctagta 



ggaacgtgct 

tggaaaatca 

caaacatact 

accaaacccc 

acggtatatt 

atagcagtag 

ataggtatcc 

ctcagttttc 

agcattaaat 

gctatataaa 

ggtttcagtg 

ttatactaag 

agccacctca 

atggagaagc 

taaatctttt 

agcctcacaa 

tcctgaggcc 

gacagtactc 

gcaagaccta 

tctctcagtc 

cttttttttt 

tcggctcact 

gtaactaaga 

aagggtttca 

cctcggcctc 

tccttattta 

tgtaaggcat 

tactgagatc 

tcaatatata 

gatctactat 

cttactttat 

aataggaaac 

ctgcaatccc 

gaccagcctg 

cggggtggca 

aacctgggag 

acagagcaag 

tgaagaaatc 

caggacctga 

ctgaaatggg 

aaggaagcaa 

tgaactgcct 

gggtgagatc 

agctttggag 

ctgtgccagg 

aatgttccag 

catattccct 

tttgagacga 

ctgcaacctc 

gattacaggc 

tcaccatgtc 

ctcccaaagt 

tcagtcatta 

tgagacagtc 

gcctcacctc 

agacatgtgc 

ttgcccaggc 

gtgctgggat 

ctttatgctc 

atggctttgt 

gataacctgt 

cagcccaatg 

ggaggacacc 

aggaacatga 



tcctctgcct 

ggtcagtgct 

gagcacccac 

ctgccttcac 

ggaaaaatat 

ccgccaccac 

actctgccat 

tcatctgtac 

gcatgaataa 

tgcttgttaa 

aagtggccca 

gtcatgagtg 

tctcatttct 

tgaggctggg 

caaccattgt 

caacctggtg 

acacacccag 

tggagtacaa 

ggttcaaatc 

tccatttcct 

tgagacgacg 

gcaacctctg 

ttacaggcat 

ccatgttggc 

ccaaagtgct 

agaagttttg 

ttagcacata 

aaatactact 

ttagttattt 

tttctgactt 

ctgcaaactg 

gagttagtgt 

agcacttcgg 

gccaatatgg 

ggtgtctgta 

gtggaggttg 

actctgtctc 

atgttattat 

ggtcagcaaa 

aagtgtttgt 

cttcacacct 

ggtagggctg 

cgcagggtgc 

gaaactccac 

tgagttcatt 

gtgccaggga 

actcaagtgt 

agtctcgctc 

tgcctcccag 

atgtgctacc 

ggccaggctg 

gttggattac 

cagtggcctg 

tcactatgtc 

ctggggtcaa 

caccatgtcc 

tggtctcaaa 

tacaggcatg 

aagcagatca 

gcaagttatt 

accttggcga 

tctggtacaa 

aactgaagct 

cctagaattg 



catccagggc 

gatgatgcta 

tacgagctag 

agagatacca 

gtcttatatt 

ttagtgggta 

ttacaagcgt 

aatggggtag 

tttgtaaacjc 

aatactattt 

acttggactc 

agtcccagaa 

acagcccagt 

ggccctgaag 

cggtgtgacc 

atagggacag 

gaagagctgg 

ttgagcaggc 

ctggctccta 

catctctaaa 

tcccactcta 

cctcccaggc 

gtgtcactac 

caggctggtc 

gggattacag 

taaggattaa 

tgggcactat 

acaaattgat 

aggaggaatt 

atttaacttt 

ggaataatga 

atagaaagcc 

gaggccaagg 

tgaaaccctg 

atcccagcca 

cagtgagctg 

aaaaaaggaa 

gaccccatcc 

ggcttgggca 

tctctacgcc 

ggccttttat 

ggctgggaga 

aggtgtgacc 

cagaggacca 

catcaacaga 

ttcaggagag 

agccagatga 

tcttgctcag 

gttcaagcga 

atgcctggct 

gtcttgaact 

aggtgtgagc 

catggttttt 

acccagctgg 

acaattcccc 

agctaatttt 

ctcctgaact 

agccaccgta 

gatctcagtt 

taaccactct 

gcaggggttg 

agtgagtatc 

cagcaaaata 

gcccaggtct 



tagtgcctaa 

atgataataa 

atgctaagaa 

ttcccatgag 

attcttattg 

cacagggtca 

gtgacatttt 

caagagcacc 

acttagaata 

taaaaaaaga 

catcctgaag 

attgcacctc 

tgggagatta 

accctataga 

ggaggcttat 

ttaggggcac 

cgcttgaacc 

tcatttttga 

tatatatgac 

atggcaatca 

tcgcccaggc 

tcaagtgatt 

acccagctat 

ttgaactcct 

gtgtgagcca 

aatgtaaggc 

aataataatt 

catgcattta 

tggagtcaga 

aagcaggttg 

aaataatacc 

catagttcag 

tgggtggatc 

tctctactaa 

ctagggaggc 

agatcgtgct 

aaaaaaaaaa 

tccattgact 

gaggggacct 

cctggcatga 

agaggagatg 

tgccacaggc 

caagatggag 

cagcttttca 

tatttactga 

aacagaaaca 

taaagacact 

gctggagtgc 

ttctcctgcc 

aatttttgta 

cctgaccaca 

caccgcaccc 

gtttgttttg 

agtgcagtgg 

catcttagcc 

tctattttat 

taagcaatcc 

cacagctggc 

tgaattccag 

gagcctcgat 

tgaggattaa 

caatgaatgg 

aaagcacagt 

gtctgactcc 
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agagtgcagt 
ttggacccta 
agataataat 
gtagcacaga 
gccgatgatg 
cacagagtag 
aattcataaa 
tcacagatga 
aagcagctca 
aagaacagga 
cagagagctg 
ggatgacttt 
aggggcgggg 
ttttaacagt 
ggatttggca 
gaagggaggg 
agaaattctg 
ctctggaact 
tgctcactct 
tgtcctactc 
agggattcca 
tgagtcaggc 
gggtggtggc 
ggacggatct 
agggcggaag 
tgggtgtagg 
gtggcaggca 
gggaacaggc 
agacctcctg 
cagtgaggtc 
ccccagtgct 
aagtcagctg 
ccactggccg 
caaggtcctt 
gacagcgcct 
cctctggggg 
ttcccccagg 
tattgtcata 
cttaacagca 
gaatattgta 
acagctaggg 
cttcataatt 
tctttatgtg 
ccccatttta 
taatggctaa 
ctcatggaac 
ttcatttaat 
tgcccacggt 
ccagtctcca 
gccacactgg 
ctcctgaatt 
aatacacccg 
aggttagagg 
cttctgccac 
tgcctcaggc 
gcggacctgg 
gagcacgtgt 
cagggcgtgc 
aacaacagcc 
ctgcggctca 
ttccagcgcg 
ggccgagacc 
acgcggccgg 
gaggccgggg 



tgttcagagg 
ggcgagtcac 
agtatccacc 
gcttggcaca 
acgatgatac 
acactcggta 
aatcagatgc 
aaagacccag 
tttgattagt 
tgctgtcagg 
tgaaccggcc 
agcagaggct 
ggagggctag 
ggagagagga 
acagcttgga 
gcaggacttt 
tgggccaaaa 
ttggtctact 
cccccaaccc 
ttgctgacac 
agcctacccc 
tgaggccagg 
agcagggggc 
caatgtggcc 
ccagatggga 
ctgggagtca 
tggggaggaa 
cccctgggtg 
gaacctgctg 
agggagctga 
ctcctctaaa 
atcatcagaa 
tgggccacac 
tctgcactaa 
ctaaggggag 
tggagctgat 
gagctcctct 
gtagccacca 
tttatcatcc 
catactgagg 
aaaaggcaag 
ggaccaaatc 
aaccatcctc 
cagatgagga 
caaacaaaaa 
tcatagaact 
ctctacatta 
ggcatcagtt 
tacttcacca 
cttccctggg 
agaaactctg 
aaagtttagg 
aggcagaatc 
ggtgcacccc 
ccattctctt 
ccaacctgag 
ttcgcggcct 
accgcgcggc 
tggcctcgct 
acgctaaccc 
cgcgcgtgtc 
tgcgcgcgct 
gcagccgcgc 
cgcccccagc 



tctctggagt 
ttcacttctc 
tcatagggtt 
tggtaagagc 
ccatcctaga 
cagctctgtg 
agatgggatc 
gcccagagag 
gtcagagcca 
gaagcaggca 
agtggggagg 
ctcagcccag 
ggctgtgaaa 
aggaccggat 
tggggggagg 
ttgcagagaa 
cctggggctg 
cttctgacct 
ccggcccctt 
ttcataaaaa 
ctggatgggc 
gaggtgcaag 
ggagaatggt 
aagaggaggg 
gggggcgaga 
atgccctccc 
agcgtctccc 
aactggcctg 
accacagtga 
agtcctgctt 
aaagtgagct 
tcccctggtg 
acacccgccg 
tgtggcctga 
tagtaatgca 
gcccctcacc 
tttaaagaaa 
tttattgatg 
aaccctcctt 
aacctgagac 
ctggattttg 
cctgtgtgct 
tggaatcctc 
aacaggcaca 
caaaaacaaa 
ccacaaggaa 
tacagatgag 
aatgacagat 
aaccagaagt 
gaacctgtag 
ccccccgccc 
aaccactgac 
caaatccagg 
cttccctccc 
cttctgtgcc 
ccacctcttc 
gggcagcctg 
cttccgcggc 
gcccggcgag 
ctgggcgtgc 
cagctccgac 
ccgcgaggcc 
ccgcggcaac 
cgatccctcc 



tggaagccac 
tgaggctcca 
gtgacaatta 
tcaatcagtt 
ctgatgagct 
gaatgaatga 
ttaaagatca 
gtgcttggag 
agagctggga 
gggatgctgt 
caagggaaat 
agggagggga 
gtcaagagct 
ttgaaagcta 
aggcaatgga 
acaaaaggag 
tgggtcaaag 
cccgaggtcc 
atcgatcctc 
gaggaacccc 
ctggaagaga 
gagccagctg 
ggtgtcagag 
ctcttggcac 
acaggcagga 
ccaacctgag 
caggcagtga- 
agcagagtgg 
tgccctgcac" 
ccctctctgg 
gggctgatgg 
agctggttat 
ccccccgctg 
ttaggtgact 
atgtggcttc 
ccaataccca 
agggacagga 
gttgactatg 
tagcctgctg 
tccatgaggt 
aactagggct 
gggcacgtgt 
agaacaaacc 
gagagatgac 
aattaaaaaa 
aggtgttcta 
gaaactgagt 
ccaagatttg 
tctgaaactc 
acatggggat 
caccccgctc 
ctcaccaata 
atgctatgaa 
ccggccaagg 
ccactccacc 
ctccacggga 
gaccggctgc 
ctcagccgcc 
gcgctcgccg 
gactgccgcg 
gtgacctgcg 
gacttccagg 
agctcctcca 
accctctacc 



gttccactgc 
tctcgtaatc 
agttactata 
acctgcttga 
ctgtaagcgg 
ggcacatccc 
cctatcctaa 
ctgcgcaagg 
gtttggaggg 
gttaagattc 
gtggtttttg 
gatagggagg 
tattaatgca 
cattcaagga 
ccccaaggca 
aggagaggag 
gcacctgaat 
cccaaaatgt 
tgaccataca 
atttaggtgt 
acaagagcac 
gaggcctgag 
gcagccgaga 
gctcagttcc 
gcacaggaag 
gcctccgacc 
gggagggaga 
atgctcctgt 
aagaggggag 
caagccctta 
gtgccaaggc 
aatgcagagt 
ttaattctga 
ccctagcacc 
cttcctctcc 
gcctagtagc 
cccaattgtt 
cacctgccag 
agggggttat 
taaaacttgc 
ctaagtgctg 
ccagcacttc 
caggaagtag 
tggcttggcc 
aaaaaaagaa 
agcaccttca 
cacagatatc 
aaatcagaaa 
aaactgtggt 
tcccaggctc 
agagatccgc 
ccactttttc 
tcaaaaggtc 
ccccagcggg 
ccacccagga 
accgcctgcg 
tgctgcacgg 
tcaccatcct 
acctgccctc 
cgcggccgct 
ccaccccccc 
cgtgtccgcc 
accacctgta 
gagatctgcc 



atattagctg 
tctgaaatgg 
taggatctgt 
caatgctgac 
gggtgcctgg 
agaactcacc 
gtcccttgtt 
tcacacagcc 
aggcaaggtt 
caaatggatg 
aaatggaaga 
ggagataggg 
tagagaacgg 
agtggcaacg 
gaggctcaga 
gttagaatca 
tccctaggat 
ggattacccc 
tctctgggtg 
tttgagtggc 
caggccatgg 
ccaggatttg 
aggttgaggg 
tgtagcgaag 
gtggaggctg 
aggctcctgg 
gccacagtca 
tctgagaccc 
gacctcaagg 
tctctttgag 
attagctccc 
ccaggaatcc 
accatagttc 
aggcaggtgg 
tcccctgccg 
agtactttgg 
actgagcccc 
atactgtacc 
acataataag 
ctaaaataac 
agcctgtggg 
cctcatatga 
gtatactcat 
aagttaagaa 
taatggctaa 
tacatgctgc 
ctgagtgact 
ggctggctcc 
cctgccaatg 
caccccaaac 
aggggatcct 
cacagcaaat 
aaccctttct 
gtctgcaccc 
tgacttgttc 
gctgctcaca 
gaaccggctg 
ctacctgttc 
gctcgagttc 
ctgggcctgg 
ggagcgccag 
cgcggcaccc 
cggggtggcc 
tgccgaagac 
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tcgcgggggc 

ggtgaggacc 

tcccgaggcc 

gtgccccacc 

tcccgtccac 

cctccctttc 

ccccctgggc 

gccctcccca 

ggcagccgct 

cagagcgtct 

ctttggggga 

tcaacctcct 

gcacctggag 

tgtttgggaa 

acttgggacc 

ttggagaagg 

cgctcccccg 

gccaagagtg 

caaccctgag 

acagccctct 

gtcacataca 

ggcactgctc 

accaaggctg 

agcgtacaca 

aggtagaagc 

agggaccgag 

gcttgaggta 

gaagagctca 

gcacttggag 

gtggtcgctt 

ggccaagaat 

ttttaagaat 

gggaggctga 

ggcagaaacc 

ctatagtccc 

gactctgtct 

gcctagatgc 

aaatgatctt 

tagactccgg 

ccagggagct 

taatgggttg 

gcctgggcag 

ttatcattgt 

aagaggagtg 

acccctccca 

cccgagctct 

ccacccacgg 

gcacacccac 

ccccaaactc 

ggatgttcaa 

tagaaaggaa 

cctcccacat 

ttgtaattaa 

cccagccaca 

cattacctca 

atggaatctc 

aatttgcata 

taaaggtaaa 

ttctagcctt 

catactggcc 

aaaatttctc 

cagagccctt 

gcagcagaca 

accttgggaa 



gccagggcgg 

agcgagggga 

ctgcgctctc 

acctctgact 

ccggggctgc 

ccctcccagc 

tgtcctgact 

gttctggcca 

gacccgcact 

tccaccagca 

tgcctcagtc 

gacactggag 

agaactttcc 

aaaaaattta 

ccgaaagcgg 

ggagccgtcc 

ccatgaggag 

ggtgctggtg 

tggtctggcg 

tccagaccct 

cactcctggt 

cctcagggaa 

ggtttccgcg 

cctgcacaca 

catcagggat 

gaactcgggt 

caatcagaac 

ggctaagaag 

agaagtttgg 

gcacctgtaa 

ttgagactag 

ctagtttgga 

ggtggacaga 

ccgtctctac 

aggtactcag 

aaaaaaaaaa 

tagggcttga 

gagaagaaga 

ttaagggtgt 

cggagtctgg 

gattggaggg 

ggggagggga 

gaaatattca 

tgtgagcctc 

ccaccagccc 

cacacacact 

gtcccacaaa 

tcaatcagga 

tccaaacaac 

ggccatccag 

gggatttgtc 

tctgagtgta 

ctcctccctt 

gtctatccct 

tgcctggaat 

tccctccctc 

tgttatgatg 

acaccttagc 

ttctctcaca 

accagttatc 

agtctctcct 

tcccaaggcc 

gaggctcagg 

gcacagccag 



ggacgcgcct 
gcagatgtgc 
ggccgggctc 
gcggtgctga 
ggctccggcc 
tcctctcctc 
tgtggcagcc 
ttaactcttc 
cctaagggcc 
gagcctttgg 
agggccaggc 
gaatactttt 
tctccactgt 
ttaaaaaatt 
gatgaggtct 
cccgccatca 
gagctgaagc 
gcacctcttc 
cctgaggaac 
gttccttcag 
cacagacagc 
acacagcccc 
tacacttgcc 
gcaaccctgt 
gaaccaggtc 
gatcagaggg 
cttcatcttt 
gacttgggtt 
tagtttgaat 
tcccagctaa 
cctggacaac 
gtggggtgtg 
tcacttgagg 
taaaaattca 
gaggctgagg 
aaaatctagc 
ggaagcaggc 
gaagaaagga 

ggggtgggct 

ggtcttgcag 
acagccagac 
gctgctggta 
tgggccatcg 
aggaagaagc 
agacagaccc 
cgcaccaagc 
caggcaggtg 
cccagcagag 
ccagattcac 
cccaaccccc 
tgagattcct 
cccccattct 
tctctaattt 
tccttcccat 
ataaaagaaa 
ccacccagcc 
tcactcagga 
agagcatcaa 
atctcatccc 
atgtatgtga 
gaagtaggaa 
aagactgcac 
ctctggctcc 
gtttcagtcg 



actgaggacg 

cccggcgctg 

cccagccctc 

gatcgaagag 

ccagtcgccc 

cccggggagc 

ccaagagggc 

cccatcccaa 

cacagcggac 

aagctccccc 

tgaccctgac 

ctcctaagtc 

ggcccctgcg 

ctattatttt 

cagaatgtaa 

aagagcttcc 

cctgcattct 

tcttcatttg 

taagcctggg 

gaaacaagag 

cccaacatgg 

atgccagagc 

tccttggcta 

gggtatgtgg 

agagaagcag 

acaggaatcc 

ctggtgtgtg- 

gggatttagg 

ttggagctaa 

ttgggaggct 

atatcgagac 

atgtctcaac 

tcaggagttc 

aaaaaattag 

cacaagaatc 

ttgggaggtg 

tgaggttctg 

gaagaaatca 

gggggagagt 

gggcttgtgt 

aagagctctg 

aattagaggc 

gtccagatgc 

ctgaagcaaa 

acggacgccc 

agagccatgt 

tcccctaaat 

atcacctcca 

caccttgacc 

accaatgctc 

ctatcccctg 

gcattttcgt 

cctgcatcta 

tctccccacc 

actgcgataa 

ataccgtggt 

gcctgtaatg 

agatccctca 

aaaccttcac 

aacccaccaa 

acctcttccc 

ctctctgacc 

ctccaagcag 

tctagaacag 



actactgggg 

cctgccaggc 

tgctttgcct 

gccagtgtcc 

caccttccct 

aggccgcctc 

gtgtgtggtg 

ggctggggtg 

accagagggg 

agggagcccc 

ccctgcttac 

taccctggac 

tggtgaagat 

atctactgta 

ggattgcagg 

tggtggctgg 

aggtgaggcg 

tccaggggaa 

gaagacctgc 

cagttctcct 

ctttgggtaa 

aaacacctta 

agtgcccttg 

tctctctctc 

gtttccaaac 

caaattggga 

gaagagaggc 

ggtgagtctc 

gaatctagct 

gacgtgggag 

tgagtctctt 

gtctgtaatc 

aagaccagcc 

ccaggcgtga 

actccagcct 

ggaatagaaa 

tgattctggc 

gcatctaagc 

gagagcagct 

caggctggct 

gtggaggagg 

aggctgtcca 

tatttcagaa 

gccactctcc 

atcacgtgca 

agcacgtgca 

tctgacatgc 

gcgatctcac 

cacacaccct 

tgatggggaa 

aaaaaagcaa 

ctgccagaca 

ttcagttacc 

acctccctgc 

cctcctcgct 

gaccagattc 

gcttcctaat 

gagtctggta 

tccagctaga 

ccgactttga 

cctcctcaga 

atatacaggg 

acggctgccc 

agaatgagca 



gggctacggg 

gcccccggac 

cctgctcctg 

gatccccgct 

ggccttgctg 

tccttgcctg 

gctcagccct 

gggcccccca 

cttttgtctg 

acccaggacc 

cctagtcccc 

actttttagg 

caaaagaagt 

agatttgttg 

gccaggaggg 

aggtggtgtg 

cagtgtggca 

gagctgcagc 

tgtctggtta 

gcaaggagga 

atgtgaacaa 

gcaaacagag 

tgcagtgcac 

agctcctgtg 

aggctagaag 

tgcattactg 

tggggactgg 

atcagactga 

tgggcagggt 

gatcacttga 

aaaaatgttt 

ccagcactct 

tggccaacat 

cggcgggtgc 

gggtgacaga 

gatagagggg 

tagggaggtc 

ctgaggcagg 

ggtcagaaac 

gtgaggaggt 

ggctgctggg 

ggtcatagaa 

cagtgaaagc 

accaaccccc 

cacccacact 

agcacaccaa 

acactgacat 

atgcgcagac 

gagataggag 

actgaggcca 

aattcattca 

cccagcctag 

cagtccccca 

tccagctact 

ggtttcctac 

atctgatcaa 

gcctataggg 

ccaactgctt 

acgtttgtat 

gtgcccccct 

tctcagactc 

gttcttcaaa 

ccgactggcc 

tctaaccgcc 
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140100 

140160 

140220 

140280 

140340 

140400 

140460 

140520 

140580 

140640 

140700 

140760 

140820 

140880 

140940 

141000 

141060 

141120 

141180 

141240 

141300 

141360 

141420 

141480 

141540 

141600 

141660 

141720 

141780 

141840 

141900 

141960 

142020 

142080 

142140 

142200 

142260 

142320 

142380 

142440 

142500 

142560 

142620 

142680 

142740 

142800 

142860 

142920 

142980 
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tggggagagg 
ttacacccag 
attttcgatc 
aggacactgt 
accaggtggc 
ccagatagac 
gcccttctcc 
ctctggagaa 
taagctgtgt 
tgtttttgtt 
tcagtgcaat 
aagttctgcc 
tcgaaggtag 
aagatccttt 
tgcaaggctg 
agtgggcagt 



actaggacac 
agtagggggg 
cctaatccga 
ggaagggaaa 
aggcccaagg 
ttgtcagatg 
aaccccattg 
acggttttgg 
gaccttgggc 
ttgtaagctg 
agcccatggc 
tccctcttcc 
gaactttggt 
ggcctttgta 
gtggagtgag 
ggagagagg 



cagatgataa 
ggacggttct 
gaggaactcc 
tttcccaaca 
tccagctggg 
ttctgaaaac 
cccctgcagg 
gacagtagaa 
aagttactta 
gggacaatag 
aaagtgacct 
agcttttctt 
tggttctccc 
gaaacttcag 
agaggaggat 



ggtttataag 
cagccctgtt 
tctccaatga 
gactctgaga 
gtcaggtttc 
tgagcatctc 
agggcaggcc 
gggtaaatga 
acctctctga 
tgccagccta 
acatgcttgc 
ccaaccccaa 
ctttctcttc 
gtgaggaggt 
agatggcgaa 



cccttaagcc 
tccctagctg 
aatacagact 
gtccaggagg 
tatatgaatt 
ctttcacctc 
tgggacagat 
cctagttatg 
acattagagt 
aatcaatttg 
tgttattatt 
agatgtctct 
aggcccaaac 
ggcagagaaa 
gccctagcag 



tctaaggttc 
cgggctccca 
tgggactctc 
ccagggatag 
tttaatgctt 
tgtacatgat 
attcagtggc 
ttcccactag 
tctgtgggtt 
ttgtggggac 
ctctttcctc 
ggctattgct 
tccccacctc 
taagaaagtg 
aggggaggga 



143040 
143100 
143160 
143220 
143280 
143340 
143400 
143460 
143520 
143580 
143640 
143700 
143760 
143820 
143880 
143899 



<210> 16 
<211> 215980 
<212> DNA 
<213> Mus sp, 

<220> 

<221> modified_base 

<222> (1001) . . (1100) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (2123) . . (2222) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (3728) . . (3827) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (5168) . . (5267) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (7481) (7580) 

<223> a, t, c f g, other or unknown 

<220> 

<221> modified_base * 

<222> (8849) . . (8948) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (10375) . . (10474) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (12270) . . (12369) 

<223> a f t, c, g, other or unknown 

<220> 
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<221> modified_base 

<222> (13438) . . (13537) 

<223> a, t, c, q, other or unknown 

<220> 

<221> modified_base 
<222> (15902) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (15939) . . (16038) 

<223> a f t, c, q, other or unknown 

<220> 

<221> modifiedjaase 

<222> (18223) . . (18322) 

<223> a, t f c, q, other or unknown 

<220> 

<221> modified_base 

<222> (20974) . . (21073) 

<223> a, t, c, q, other or unknown 

<220> 

<221> modified_base 

<222> (24403) . . (24502) 

<223> a, t, c, q, other or unknown 

<220> 

<221> modified_base 

<222> (27574) . . (27673) 

<223> a, t, c, q r other or unknown 

<220> 

<221> modified_base 
<222> (30892) 

<223> a, t, c, q, other or unknown 
<220> 

<221> modified_base 

<222> (30901) . . (31000) 

<223> a, t, c, q, other or unknown 

<220> 

<221> modified_base 

<222> (34443) . . (34542) 

<223> a, t, c, q, other or unknown 

<220> 

<221> modified_base 

<222> (38205) . . (38304) 

<223> a, t, a, q, other or unknown 

<220> 

<221> modified_base 
<222> (42373) 

<223> a, t / c, q, other or unknown 
<220> 

<221> modified_base 
<222> (42386) 

<223> a, t, c, q, other or unknown 
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<220> 

<221> modified_base 
<222> (42393) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (42461) 

<223> a, t f c, g, other or unknown 
<220> 

<221> modified_base 
,<222> (44809) (44908) 
<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (51380) . . (51479) 

<223> a, t f c, g, other or unknown 

<220> 

<221> modified_base 

<222> (56740) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (56765) . . (56864) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (62818) . . (62917) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (68518) 

<223> a, t, c, g f other or unknown 
<220> 

<221> modified_base 

<222> (68534) (68633) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (74552) . . (74651) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (81446) . . (81545) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (88519) (88618) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (93791) 

<223> a, t, c, g, other or unknown. 
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<220> 

<221> modified_base 
<222> (93794) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96565) 

<223> a f t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (96570) (96573) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (96579) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96590) . 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96596) 

<223> a f t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96602) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified__base 
<222> (96616) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96629) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96633) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96668) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (96715) , . (96814) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (104447) . . (104546) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modified_base 
<222> (114521) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (114527) (114626) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (127063) . . (127162) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (139133) . . (139232) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (151051) 

<223> a, t, c, q, other or unknown 
<220> 

<221> modified_base 

<222> (153242) . . (153341) 

<223> a, t, c, q, other or unknown 

<220> 

<221> modified_base 
<222> (164706) 

<223> a, t, c, q, other or unknown 
<220> 

<221> modified_base 
<222> (164708) 

<223> a, t, c, q, other or unknown 
<220> 

<221> modified_base 

<222> (164710) . . (164809) 

<223> a, t, c, q, other or unknown 

<220> 

<221> raodified_base 

<222> (182242) . . (182341) 

<223> a, t r c, q, other or unknown 

<220> 

<221> modified_base 
<222> (192158) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (192192) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (198842) . . (198941) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modified_base 

<222> (199437) . . (199438) 

<223> a, t, c, q, other or unknown 



<220> 

<221> modified_base 
<222> (208276) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified__base 
<222> (215974) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (215976) (215977) 

<223> a, t, c, q, other or unknown 



<220> 

<221> modified_base 
<222> (215979) 

<223> a, t, c, q, other or unknown 



<400> 16 

ttgggggtat aaacccagaa gtgggattac tgcaccatac aataatcctc taacttcaag 60 

caatttttcc acaatggttg tatcatttta cattcccact ggctacgaga agggttccca 120 

cttctacaca tcttcaccac catttctgtt tttgtttttg agtaacagct gcctaatgac 180 

tgtgaagtgg tatcttatct cagtgttgat ttgcatttct ctgatcatta atgtgggaag 240 

gcatcgtttc atatgtttat tggctgtttg tgtatcatct tctttggcga tgttgattca 300 

agttatttgc ttgttttttt aattggagtt ttaaaaaatt gttgttgagt tgtgggagtt 360 

cttcattagc tctgcatatt aataccctga tgaaaatgat taacaagtat ttgcttccat 420 

tttgggggct tccattctgg gctgttttta ttcttttgat actcttttga ttctcaacag 480 

tttaatctga ctaaaattca gtttatttct tcttttaatg gccatgctat tgacacatcc 540 

cgtaatcact gccaaatcca gtcatgaaga gtttctttca agagatttat agttttagct 600 

ctttaagttt gtcatgtctg tttcacttaa ttttgtatag tgtacaaaag tctaacttca 660 

ttcttttcta tatggcttgc tactagtata cgaagagcta aatttctctt tccttgagtc 720 

tcaacctctg atgtgtagca atttcttcag aggaaaacat ggtgggaagt tccttaaaca 780 

taggatgctc catggaggtg aaatagttca tcctacaggg aagcttgtta aacacaggaa 840 

gtacatactc agcagctcta gtaagtgagt gaaactgact ggaggcacta ggtccctcct 900 

tccctacgca tatagaagct gtaaggattg ggaagagata ctgtcaggtc agctcagctg 960 

ctgcccggaa gaagctcaga cccactggcc tggctccaag nnnnnnnnnn nnnnnnnnnn 1020 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 

nnnnnnnnnn nnnnnnnnnn atcactcttt actcaggcca cctacacgct gtttatagcc 1140 

tgcctttgtc tctttggcta tacttcctgt ttatgtctat gcctcccctc tttctttttc 1200 

tttctcttct cttctcatct catctcatct ttcttcaggg gggagcctgg tctagaactc 1260 

acaaagattt gactgtctct gtctccttgc actaattaaa aaatctttta caagcatctt 1320 

ttagcaattc ttacagggaa attttggaat gttaaactct gattgttagc gggctgaaga 1380 

taacaatagc tctgatgata aattgcttgc caggcaagtg tgaaaatctg agtttgatcc 1440 

aaaaagccgg gtacagaggc caaagagtcc ataatcctag taggggcagg aatcagggat 1500 

gggtgggtcc ctggggtttc ctggtttgtc agcgtagccc aattgggaat agccaggttt 1560 

cagtgaacga tgctttctgc aagctgagag aggtccttgt tcaatctctg tgacccaact 1620 

ggagggagaa gagagccagc tctccagaag tggtcctctc aactttgtgc atgcatgtcc 1680 

atgttcacac agggaatgga taatgcttaa aaggaagacc ggcagggggt tggtaatgca 1740 

cctcctttgg tgacatgctt tcctcttgtt catgctgctc caggtgtggt cggcagcacc 1800 

aaaaaccagg tgtatgtttg taatcccagt attctctggt cgtcagtagg aaatgaaaag I860 

cgaggtcatc ttcgtataga gttagcaaac tctaagccag cctcggctac atgagacttt 1920 

gtctcaaaac aaaggaaaaa tcaaggagga cggctcccga gcactgtcac ctgaagctga 1980 

cctctggcct ccacatgcat gtgcgcaaac acatgtcctg cacaaacaca cagacacccg 2040 

catctgctcc ccgacaaaag aacctgaaac cagtatactt tgagaatttc ccattcatag 2100 

ttaccattgt gtgttccttg tgnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2160 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220 

nnggtggtgc ctttctctta cccagtctag aagggctgga ggcagggtgg atggggcact 2280 
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ttgaactccc acctaggcaa aaaacccagt gatctctggg ccagtgtgtt gtttgcaagg 2340 

gaataaggta gagagccgcg gaggaagaca ttgggggttc tatgagtatg tgaaggggtg 2400 

cacacaccac acacacacat tttttttgtt ttaaatttac aaacattaaa ataggctgta 2460 

atgtggctca gtgggtagaa aaacctgctg tctaagcctg gtacgagttc aatccctgac 2520 

aagctggaag gagacaacca accacaactg ctagcagcca gaagcactgc ttgctaacac 2580 

tcaagagagc ctggagtgga agacactgga tccccagcag gcaagcctgc aagaagatgt 2640 

gccttgccta gacaacggca gaacaaacat caaggctggc agagctgtcc aggactgttc 2700 

atattaatca tgtatagata agagggaatg gcacagacag aacaattcaa cacacggggt 2760 

atgaaaggaa aggaacaagg cacacaaagg acaaagaacc tagcatacaa gaaagcctaa 2820 
gcagagagtg gcacttccca gaagggagtc ataaaataga ctgaattcat taaaacaaga 
gccaaagata aacggctcaa aaaactcacg gaaaacaggt caaaataacg tcacccatct 
gacagttgat actgtcaact taaccgtatc tagaactcca gcaggcacat ctccaggcat 

gcccctgaag gggtctttgg actaggttaa ctgacgtggg agtgacacca tctatggacc 3060 

gaagcctcag acagaataaa aaggagccag tgagctgagc gtcagtgctc attgcttctg 3120 

gcttcctgtc tgtggctgca gcgagacacg gtgcttcctg ctttagctgc catgacagac 3180 

cacaccctca aaccgtgaac caaaataacc tcctctctac attgctttta ccaggcattt 3240 

ggtcacacca atgagaaagg ttaactaata cagcactcaa tacttaaaaa cataaacacc 3300 

aaccttgttt gcatgtgtga gactttgaag ctcacgggcc agttatgccc aatgccaggt 3360 

ctgctggcta agggtgagag tgcacaccta taatcccagc tgctgtggaa tcagcaaaag 3420 

cgctacagat ggaaggcagc cagggcagct gagactgact caaactgata gaggtgggag 3480 

gcatagagaa aaccagatta atagagtgtt ccccactatg caagaagccc tgggtttcag 354 0 

gacgagagaa ctaagaatac agaagtctac tgtgtagaag cactgctagg tcacacagaa 3600 

acatcactca agtgtctctg gatgctacac ggagggcgtg tgaagtattg cttcctgatg 3660 

atctgtatct actacagcac tgctgtttta gtatgcgctc ctccactaca gctcctcacc 3720 

acaccaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaat taatcaaaga 3840 

aaacacacac caccagttag agaaagttaa tcaggccgaa tggcggcttt cccctgtatc 3900 

caggctaccg tcaggacggc tcactgccac tggcaactct gcctgaacaa agcccgcagc 3960 

caacgtgggc ttcaggggct ctaaacactg caatcaaagg ttgtgtgtgg gggtgggggt 4020 

gctgctgcta ttcaaggatt cccaaagctt agatgtattc atcatactca caggaaagcg 4080 

tgttcaaccc atcactcatg agcagtcggt accggggtga cctattccct gtagaaatgg 414 0 

gacggatgtt ctggaaaagt tgacagaaaa gttgattcat taggcaggct ctttgcccaa 4200 

gccctgaggg taagcaaagc taactggcag gagactaggt ttgccattaa tctgagacaa 4260 

gatgaaccac ttgcccatcc tcctgacacc taaatactaa tgaaagaaca atggattgag 4320 

ctggcattat taaaaacgat agaaacagaa gtatcaatag tcatgtgttc tttctcccat 4380 

atgtcaaaac aatgtgtaag atggcatcga acacatgcag aaactgttta gggaacatgc 444 0 

tgaaaatatg aagtaaaatt aaaattggaa agaaagacaa tttgcctaaa gcagctcaga 4500 

gctggagaag ggaccgaggc agagataaca gcaacgtgtg gacatacgga tctggggcag 4560 

agcagtcacg gactcagccg gaaagggtgg ggcagcctct gaaggaagtt aaggtaaata 4620 

gagccacaag gtgattggcc caggagtggt gccaccttca cctcctgcct caaagtctga 4680 

aggaatgatc ctggagtctc ccatctattg atatatgaaa ttcacagtat gttttagaac 4740 

ccactgaatg atgggtagat taactaaaag aaatttaagc ggggtggtgc aggtctttta 4800 

atcccagcac ttgggaggca gaggcaggtg gatctctgtg agttcgaggc cagcctggtt 4860 

ccaggacagc cagagataca tagagaaacc ctgactcgaa aaaacaaaat taaaagctca 4920 

tcaaaacaac aacaacaaca aaaaaaacaa aaaaacaaaa caaaacaaca aaacacccta 4980 

tagtacctgt tggtgagttt gagtgagtga gtgagtgtgt gttagagaga ggggcgggga 5040 

aagtgtgttc tggaaatggg agaaagagaa tgtgcatgtg tgtttctggg atgtagacaa 5100 

aactacatgt cttccatcaa atgcaatgtt taattatcta tgagttgaac catcttcatt 5160 

ctgctaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5220 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaac aaaaataaac 5280 

caaaccagta aacaaaatcc tgtaagataa agcctaagac aagacacttc ctggggctgg 5340 

ggagttgctt agaccataag gagttcataa tccaggcgtg agagcccgag ttcaggtccc 5400 

tgggcttcca agtcaggagg agaccaagga atcaacaagt ctcgactttg gtctctagtc 5460 

ccatgcacac acacgcgtgt aaatacgtag atgttcactc acacacagaa gactgcacct 5520 

ggctctctca catctcagcc aacatataaa gcctgcatta tcagaacatt ctaggttcta 5580 

gtttcagtca actcttacac agaatggcca tcatactccg tctacaactt ctcctgatct 5640 

acccacgtgt cattgcttca gtattaacaa aacccagaat aaccagctgc gtagatcctc 5700 

cctgatgccc cagtcattgt cttactgaga ctactaagtc acaaggtagc actctggatc 5760 

caaaaagcaa tatccaattg agagttacaa cctataagga ggagtttacc ttcattatag 5820 

ggcactggat tcccaatctt taatccaacg tcttcagcag atttcataac ttccaagtcc 5880 

atcaaaacaa ctactttcct acaaagacag acacaagtta gaattaagaa ctctgcagcc 5940 

tttcagatga gttactaaga agcttacttt agtagttgtc tggctaaaac tgtatccttt 6000 

accaaccttt tctcattctg gactaacttg agaagtatta attcctaagt aaatacttca 6060 

cttattcttt ccccacatct ccaatgtttt tgtctttaat ttattatagg gcaattcatt 6120 
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tcctatctag ttccctgatt aaaacagtag accttgctgc atgccattat cctcatggag 6180 

gcactgatac aatttagatt attaaataca aaaccctaaa acacaaaaag atgatttttt 6240 

tttaaaacaa gattttaaaa aaagcatgtg ctacgcttcc ttctgccact aagcctacac 6300 

atggtcctct gactgaattt ttcccctcat tctgcttcat ctaatatgtg cttttcaaac 6360 

ctggaattga accagggact tattcatgct aggcaaatgc tctaccatag agctataccc 6420 

ctccaactcc catctcaaat atcatttcca aagacatttt cttggtctct tatttagatc 6480 

aggtttcttt gtcctcctgc agctatgact tcattccttc agaacactcg tcttagcttt 6540 

aagttctgta ttaattagtg attgttttca ttctctctgc tagaatgcac tttcaataaa 6600 

ggcaggtagc cagccacagt gcttaattaa gcaacagccc aacgatgtca ttcactacat 6660 

actgggacaa gatgcctaac atcatctgca gataaagacg aactactggt gtcaggagac 6720 

agctaagggg tccagggctt gggcacgctg agtgtgagca ctggagtccg ggtgcccaga 6780 

aacgcacata aatgcaatat ggatgtggca atctacctct aattccttct ttaagacagt a * An 
ggctctccag agcaagctgg ctagcaagac aagccatatc agtgagctct gggcttgacc 
aagaccctgc ctccaggtgt aactcccaag caaaaggatg atggctcaca aatctcaggc 

tatcatgttc atgtacaaaa tgtcaaccgg catacacaca tgcacacaca tgaaaactgg 7020 

gagaaaataa gaagaattgc aaccaaaaaa tgtaatttga ggacacataa ttgcaggcgg 7080 

ggagtggggg gatgacagaa ggtgaactga gtggaccgag ggaaagctgt gctagcggca 7140 

atgagaagaa gggtggggca gtctgagcaa gggttcagca atcaccacgc tttactgtct 7200 

gcacagcctg gctgtagaat gctgggcttt atcacacaga attattcagt atgtgctatc 7260 

tttacagtaa agttattcta tcaggctatg ctacttcaat agaacaagcc tgaaaaagtg 7320 

gtctgctgct gagaacctga caaagatgac ctgttagaac tgtctgccaa gtgtggaatt 7380 

ccagcactgg ggaccaggag ctcgagggtc accccagatg cagggagtta gaggccagtc 7440 

ttggcaacat aacatcatgc ttcagaaatt aaaaacaaaa nnnnnnnnnn nnnnnnnnnn 7500 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7560 

nnnnnnnnnn nnnnnnnnnn catgagatag ttaataaact gaagaaagcc atacaaggag 7620 

taaagtagat agttgcaagc atgaagaaag acaaaccact tgagcttttc ttttgtcgta 7680 

aggaggaaac cagacaggtc cagagagatg gctcagagat* taagagcact gactgctctt 7740 

ccgaaggtcc tgagttcaaa tcccagtaac cacatggtgg ctcacaacca tctgtacagc 7 800 

tacggtgtac tcatatacat taaataaata aataaataaa taaataaata aatcttaaaa 7 8.60 

gaaaaaaaaa aaaaacctaa ccaatcagcc aggcgatggt gacacatgtc tttaatccca 7920 

gcacttggga ggcagagaca ggtggatttc tgagttcgag gccagcctgt tcttcagagt 7980 
gagttccagg acagccaggg tgatacagag aaaccctgtc tcaaaaaaca aacaaacaaa 
caaacaaaca aacaaaaaag gaggaagcca gacaggatgc actttatacg tgaatggaat 



8040 
8100 



tgacaaaaga caagttctat aagtgttagg gaaaggggga ggacaacggg ggttcatgtc 8160 

tgtggtggaa cacgtattag aaggctctgg gtatcctgtt tccgacaaac aggcactccc 8220 

aatcacacag gccactggat gtctcaggca gagaaagatg tgatagattg actttttaac 828 0 

aatcacagac tgtgtggaaa atatttgtaa ggttgtcatt gtcacccagg atagagctga 8340 

tggttattca aacgaggatg ggacaacaga aatgggagag agggatgtga gaaccatttt 8400 

gaaccagggt gatttactgc gcacgtgtat agggtctaca gggagtggga tatgtagagg 8460 

aggcctatgt tcctaacttt ggtaatgagc ttattacagt tactatgcac agcctggaag 8520 

atactggaaa aggtgcaggc taggctagaa aggtactaac tgagggtttg acagcccctt 8580 

ggatgtcagg atgcagcaag cctacctctg tatgtagtca atcccttctc aggctatggg 8640 
tcctgcagat catccgtctc tgtatccatt attcccagtc catcctctga gtggctccct 
cttatccagt ttaacaaaat gctgactgca agctcccaag cccagggctc tggctccttt 
actccttgtt attgtacttt accctgtttg cttgggatag agtgtgccct ttataaacat 

ttgtgaaagg gggaatgaag aagaataann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8880 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnag agagctcaat ggttaggagc actggatgct cttccaaagg tctccagttc 
aattcccagc atcaccatgg cagctcacaa ctgtctgcaa ttccagttcc aggggattca 

acactcagaa acataagtgt aggcaatcta cgtaacataa aaataaataa atgagctgga 9120 

aaagaaaaca tgtttcaaaa tatacaagta atggggctgg aggagatgtc tcaatgggta 9180 

agatcattgg ctgctctttt ggaggttctg ggttcaattc ccaccaccca catgacagct 9240 

cacaactgtc tgtaactttg gtcctgtggg agctgatgcc ctcttctggt gtgcagacat 9300 

acatgtagac aaaacacctg catacataaa ataagttttt aaaaaagtta cacatacacc 9360 

cgtgtgtaat ataacacaca ctggcttaac ttcctcagca ctgactgttc accatacgga 9420 

ttcccatgag gttttggttg cattctatca ccgaaaaaaa aaaaaaaaaa ttagaagaaa 9480 

gtatatacat ataaacctct ccctaaaata aagttttctt ttctaaaagt acatccttat 9540 

ttttttattt tttttttttt ttaagaaatg ggaacaacag ttctgctcac actgtatttc 9600 

tagcatgtaa catcttgcaa gtacttaacc gtattctata tcagctcaac acacttacta 9660 

ccgaagactc aagatcacaa aaaaaaaaaa aggacccaga ctggataatt aaacgtttct 9720 

tttgttgtag taagcgacct cttccttaga agatactaca gtaatgctga agaaatgaca 9780 

catctactgt aatctgttct ctgggattcc aacttgtttc ctctgctact cctcccttgg 9840 

cggcaatgtt cgtctgcatc cggctgagct cctcgctgcc ttgttaaacc tccttcctga 9900 

acttccgacc tgtagttccc gctctacagt gcaagcgagt ggataaggaa gcgcatacct 9960 



8700 
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gccgtctttc agggtgttga cgatgaactt gtggacctgg cagacacagt tgctggccag 10020 

ctgccctccc tcgaccaggg tgttcagctg cgtggccagc atgaacgctg caaaagcaga 10080 

gagagagggg ctcagtctcc aagcctttcc ttaacccgaa agctcatcac aaggagaacc 10140 

attaaataca gctgtttaaa actcctccgc cctgcagaga ggaaagcagc atcaatccgc 10200 

cccatgtaaa agtctgaggc tcttcctaaa tggtatctgt ttctcacagt ctccaaatca 10260 

tttttactgt aattctagtt tctggggaaa gacctttctc ggtctttagc cccgtgacta 10320 

gagacaacag gcaaatattc cagaaaggcc cccattttct ttttaaagct tctannnnnn 10380 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10440 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnngcacat cttgtgaagt gtccacatct 10500 

ttcggtccct cgaatttggg tttcttctgg gacgtggtag catgtgactg tcactccagt 10560 

gcttggagca gcagaggggt caggaactcc aggctggcat tagctgcaga gctggagcag 10620 

gtcctggaga acagaaactt tggttgcagc attaatgaac tagaagaatt tttttgtctt 10680 

ctgttaaata taaatacctc cattatcttc tcataaacag tgttgccttt ttatttaagt 10740 

ttttaaggat caggcacaga gactccatgc cagactacca ctcaaccact gagctacacc 10800 

cccaacttgc ctttctgcta ttttttaaat tgtatcagtg gccaccaaac atggggagag 10860 

gtcagggggc tacgtggagg aattgtttct ctcctaccaa gtgggcccca ggtttcaaat 10920 

tcaggtgacc tggcttggca gcaagcacct ttacccctaa gccatctcat tggcttcatc 10980 

ttttaatggc cccttcccct gctctgaggc aggctctccc tatatagccc tggctggcct 11040 

caggctcgca ggtccaccag tgagcaccag gtttctgctt gtccttacct ccccagcact 11100 

gtggttataa gcatgtgcca ctgtgtcaaa ctcagtcact aagctttgcc aagccatagc 11160 

ccagcccttg agtttactgt ttgtctgtgt ggtgatttgt caaaccactt ttgttccact 11220 

gaggtatttt gtcaagtttg acaaaattag ttgagtatgt aggtcttttt ttctggaatc 11280 

ttctgttata gcttagtctg gtcttgaact catgatcttg cctcaacctc acgattattg 11340 

aggattattg agatggacag gctgtgtgac catgctcggc tgtgtgtttt agcatgcatt 11400 

agtcatttga aaaacgttgg ctcatgacac tttacaggtc ttccatgttt gatatgtttt 11460 

atttaatcca aagtaattcc agcaccagag gctgagacag gaggatctca aggtcaacct 11520 

agagatgcat agcaggcggg gccccactcg gttaggttaa- tatcatcact gacttcagga 11580 

gaaaagtctt aagtattggg gactaaaagc aggaggatct gaagttcaag gtcatcttta 11640 

ggaacttagc agacttgagg ccagcttggg cgctgtggga ccctgttttt aaaccagaaa 11700 

acaaattgaa aggaaaaaaa aaaaaagctg gaggaagtga atgtgagtgt tcacatagtc 11760 

ctgtttccac aagaaaacag ggttactttt ggcaacaaat aggtgctttc tttgaaggct 11820 

ggcatttttg tgacttgtca ttggagaaat gatttaatta agacttttct actgagtgcc 11880 

tctgaagagg ctcttttaaa tttagtttaa ttttatctca ttgttagtgt ggtgtgcttg 1194 0 

tgcacacaga aggcagcttt ctagagtctt ttcactctct cctccacagc tcctggagtc 12000 

aaactcaggc cctggctagg caagctctta ggacagtgtt agctgtagct tattaagttt 12060 

ttaagaattt ttataagact ctgtttttct ttctcaggtc atgatacagc aggaaaatac 12120 

atccataaag cccatcctgc aggtcattgt aagtaccggc atgtgtgttt agcataatga 12180 

agatggttca cttatagtta attaaacatt ggattggatg gaagacatgt agttttggtt 12240 

acttcccaga aacacaaatg cacattcttn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12300 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12360 

nnnnnnnnng aattcagagc tgatatgtag tactaactcc tactcaatga atcctttgtt 12420 

cttctattcc ttcattacat tactgttaat agtggtaact atgtaccaaa gagtcaaata 12480 

actcttggac catccaaggc agaaggaagg ctggcaaaaa tgtatgatga tctgggatgg 12540 

gaatgtactt cagtttgtac aggaggccct tggttcattc catttctggc aatgcataga 12600 

cctgtaggat ctcagcactg gtggggggtg gggggtgagg gtgaaggggc gggaggttaa 12660 

aggcagaata gtcataaatt caaagtctgg gtcctggaaa gaggactaaa cgattaagag 12720 

ctttagctgt tcttctagag aacctggtgt gatccccagc acatggtgcc tcacgactgt 12780 

ccgaaactct gattctaggg ggatctgaaa accctcttct gccctctgta gatacagaac 12840 

acacatggtg cacatacata catgcaaccc aaacaaccca tatacataaa atattttttt 12900 

ttcaaaaaga cattcaaatt cttcctcggc tatatagtgt ttaccaaacc tcaaaaacaa 12960 

aacaaaacaa aacaaaacaa agaatcatta atgttttgcc ttcatgtatg tctgcccacc 13020 

acggacatgc ctggtaccca gggagattaa aagaagacat tagctcccct ggaatggaga 13080 

taggtatgat ctaccacttg ggtgctggga acctgggtcc cctgcaaaag cagtaaatct 13140 

ttttaacccc taagctgtct ctcccaacgc ctaaagattc ttgtaacaca gcatgatgag 13200 

cactggcaag catagcatgg taatctgact tcagggcgcc agattttgag cttaatgctt 13260 

gattattaga agtaacgtac tagatttaat gcctggagct tcaagcaaca aaattaactg 13320 

aagaataaaa ataaaaaccc tgccagccat gatggtaatc ccagaacttg agaggcagag 13380 

gcaggtgatc tctgtgtttt gcaaggccag ccacaatcta catagcacgt tgcagtannn 13440 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13500 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnncga ataaatctac acatgtaaaa 13560 

agaaattcaa agaaacaaat gccaaataaa tacacatatt gtaataaaga gataattgtc 13620 

taaaaaactc aaggctttaa atggtaagat atcatattct tggatgaaaa gatctaatgt 13680 

caaaatatat caatttaatg caattatgta tattcaggag atctctggtt ggcttttgaa 13740 

cttgatagca ctcttataat tcacatagaa gaaaaaaaac catgaaaact gccaaacatt 13800 
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attagaatac tccacagatg gtattttggc agcacataca tcgaagggct gtgaaagatg 138 60 

tgtagatcat ccacgccttg ctagggagag ggcgggtgtg tgtggggggt atagctgttt 13920 

gggaaaataa cctggtaatt cctcattagt taaatcatag tcagaacctg gactagcaac 13980 

ttctctctaa aatacattca ccctcagcat ctgcattgcc aggaaaccac tcctagcagg 14040 

atctgtacgt ggatcaaggt agtagcatct gcatttaatt gacattctcc taaatgcttt 14100 

aaattatctc tagattactt atagtagcca agatgatgca aattatgtta cactgtatta 14160 

tctggggcgt aacaagaaaa tgtctctact caggttcatt caggtgcagt acttcccctg 14220 

aatacttctg aatacacgga tcaagaagcc acagaaagag ggctaaccat atacaagcat 14280 

atagtacact aataaccatg tacaaccata tagtacacta atattcagtg cattactcaa 14340 

aatgcaaaca gatggaaaca atccaacagc ctgtaagctg aaaaacaaga taagcaaaat 14400 

gtgctgggcc tagaggccca ggtctataat tccaactaag gtcgaggcag gaggatctca 14460 

agttcaaggc cagcctagac aacttagcaa gaccttgtct caaaacaaaa agtaaagagg 14520 

ctgaggatat agctcagtat agagcatctg cttagcatgt gcactgacag ccgtatcaca 14580 

gaggaaaaaa aaaaataagc aaaatgtgat ctgtctgcac aacaggatat cacagccccc 14 640 

taccacaggg gaacgacaca gtaacacaac aaaaacttag ccctgaaaat actatggtaa 14700 

ataaagaagt gtcactgagg atcaggaaat gcatgactcc atttacatta tatagaaatg 14760 

agaagatcag tgagcctcta ggactcaaga gatttgggat tggcagctaa agggtactgg 14820 

gtttctttat gggggtaaga aaacattcta aacttaactg tgagaatgac tactcaacaa 14880 

tgtcaagtgt tcaaaaatca tacttttttt tttttttggt ttttcaagac agggtttctc 14940 

tgtgcagtcc tggaactcac tctgtagacc aggctggcct cgaattcaga gattcacctg 15000 

cctctgcctc ccaagtgctg ggattacagg catgcgccac cattgtccgg ctcaaaatca 15060 

tacttttaaa aattgcccag tgactcatga atacaatcag aggcgggaga ggacagtggc 15120 

aaactcagga taccagtgtc ttttatgtct gctgcccaac tatcaatttc ccatagttac 15180 

cagagaactt tttggtttgt ttcatcttat ttgttgcttt tggtagaatc tcaatatagt 15240 

aagatacaag gctggcctca tactatatag ctgaggacga ctttgaactt ctaatcctcc 15300 

tgcttccatc tcccaagtgg tgggattaca ggggtgtacc gctatgccca gcaagcacaa 15360 

agccatttga accacacccc agccttttca gagaaacctg tacaagcctt agtgccttag 15420 

catattaagg caacaaaaga cataatgcgt ggctaccata gagtgtttgc ctaccatgtg 15480 

tgaggctcta ggctaaatgt ccagcactta taaaaaagag ttaaaaacac tcatgactca 15540 

aggatgacta tgcagtcttg tgtacaaagc cccgcattca atccccagca ccgtgcacat 15600 

caggcaggct ctgtagagga cccagcttaa ggtcatcctt aggtaagtta gaggccttag 15660 

atggctacat tagatgagac cctttctcat aaacagaata aataatttaa agctcctgat 15720 

caaacactat gccttcccat cacactcaga ataaagcact ctactggccc tttaaggact 15780 

gcccatctgg aagagaaacc taagttacat tccttgcttg tgtcatatgt gataacaaac 15840 

tcactggaaa tacgaaaata cagtcttaag cttggtcaga aagcttcccc agcaacatga 15900 

tntcagagga cataatgcag aaagtggaca aatgcaaann nnnnnnnnnn nnnnnnnnnn 15960 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16020 

nnnnnnnnnn nnnnnnnnaa tcagaggaca tctttcagga gttagttctt tcctccctct 16080 

atagctttca gggatcaaac tcaagtgtgt actgagcgct tatgcccagt gcgccatcgc 16140 

accaggcctg cttctttgtt ttttatgggt ctgaatcaat tagcaccatt acaacaatgt 16200 

tgacaatcag caagtacctt tctctacctg gctagtaaga gaagtaagtg cctttggtgt 16260 

gtgaacgcag tttctcttgt gaagtgcatg gacttgatct ttgctcacaa cgttttttag 16320 

gtccttaagt tgcttgggtt ttatgggaaa ggctcttggg ttttttgaaa agattttact 16380 

acaacttgat ataatcatta tttttaatcc tttaaatagt atgacttatt ttaacagatt 16440 

aatattgaac tgttctttca ttcttacata ataaatcctg ccttaaaaaa taatcctctt 16500 

agcttccttt ctctattttc aaatttgttt tatatttttg catgattttg aacatttata 16560 

aaagtaggca gacaacacag tagaaccaag tccccatata gctgtgcaca tagcttcaga 16620 

ttattgcctg ataataggtc ctgttttgtc tctgttttct cacagggatg tgttattgtg 16680 

tgtgtgtaca catacataca tatatgtatg tatgtatgta tgtatgtatg taatgaactc 16740 

cctttaacaa aacaagtact gggctgggaa gacagcacag ttagttatgt gtttaaccgc 16800 

acaagcatga caaccagagt tgagatcccc accaaccgca taaaaagctg ggcatagtgg 16860 

cattgacctg tagccctggt gctggatgaa agctggggag gcaggtagat cggcagagct 16920 

tactggcaac aaatctgccc agtaggtaag ctctgggctc agacatccta tataggaaaa 16980 

agatgaaggg cgaggcgcag cggcacacac ctttcgtggt agtgcttgag aggcaggggc 17040 

aggccagtct ctgtgaccag cagcctggcc tacatgtcaa gttgcaggac agccagagcc 17100 

accacctacf gagactgtct cagaaataag ttttttaaaa aattgagatg aaggagctgg 17160 

taagatggct tagaaggtaa aggcacttat cactaagcct gaagccccga gtttgaccct 17220 

ggaccccaca ctgtagaacc aactcctcca agttcttctc agacctccag cagagcacaa 17280 

gtgtatgcag acacacacac taagtaagtg aatgtaaaaa acatgacgta gtggcactgg 17340 

cctttaaacc cagcattggg aggaagaagc gggtggatct cttgagtttg agaccagctt 17400 

agcctacata aggaatttca aggcagccag ggctacctag aaagtagctg tttatgaatg 174 60 

aatgaataga aggaaggaag aaagagagac agacttaaaa aatatatgct ggagagtaac 17520 

agaagaggac accggcttgc tggtgtcttg acctctggct tgtacacata cacatgtgta 17580 

gtgcatacac ccacatacaa ttgtactcag acacacacaa acatgtactc attcatatac 17640 
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tgcacacctc aacactcaga aaatgaaaaa acaggtacca tttacacctc cgtgttcggt 17700 

ttccaaccac tcatatgtat gggttgtaaa tgcttatatc tgtatgtgtc tgtatatttg 17760 

tgtatacatt caaagttgag tcaggatcca acgtaaactt ggatagtagt gggttgatgg 17820 

tctggaagcc tgctcgcagc tgtctttttc tcctcgtacc ttttcccctg tttgtttcta 17880 

cgacagcagg tcatttgtct ctaagtgtta gtttcccatc ctctctcttt tgctgatggt 17940 

agccttgtag tagtcacctg tgttctctgt aaaatggctt tgccgtgtta tttcaatatg 18000 

ctatcatcct catcttgcta tatttcattc aatatatgta tatattacaa gatagattaa 18060 

aattatttta attttatgct tatgaatgtt ttgcctaagt atattgcacc ttgtgtgtct 18120 

agtgtccaca gaactcagaa gaaagtgtca catattctgg aactggaatt gcaggtggtt 18180 

gtaagccacc atgtgggacc tggaaaccaa atccaggcgc cannnnnnhn nnnnnnnnnn 18240 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18300 

nnnnnnnnnn nnnnnnnnnn nncctttggt caaagatcct cagtttcgac tttgattacc 18360 

cagacttcct gtttctctca tggaacagtt tccccctgag atttactagt ggaagaaagg 18420 

cactcaaaaa gcagggagcc ctcgtacaaa tgagacttcc tagctatata attaggccga 184 80 

gatgcacaca tccacagtca ttacccttct tcagagcctt tgtcatgtca agtgtatttc 18540 

gcccatgtga actttagaac tggcttgttg tgtttcataa aaagtgtagt tgggctcttg 18600 

attgggattg tgttaaattt atagattcat ttagggagag ctgacaactt tacagtatta 18660 

aatgttttca tccgcggaaa aggttgtctt gccacttact tgggctttct tttatgccct 18720 

taagtaaagg tttatagttt tctttatatc agtcttgcac atttcctgtt agatttattt 18780 

ttgcttgtaa ggtttccttg gttggtattg tgtataaaca ttttctccca attacatacc 18840 

ataattgatg gttaggagta taaataaaag gtagaatttt aaaattgctg tatgaatgac 18900 

tgctttgcta taattcaaca tcttttcatt tttatgccaa tctacttgac atgtttaggt 18960 

taaatgatga tatctgtaca gagtaatcct ctgatccagt atttgcacat cttactttct 19020 

aacgtccata gcatagatac acatcttata ctgttgagta catatatatt taaggtattt 19080 

accatagtct tataatatgc agcgtgcttt ggttcaagac agttgccctg tgttcctcaa 19140 

cattaacatt tttttcatca caaatacaca ttgaccttta tcaaattttt aaaactatct 19200 

tgagagaaat gaccattttt cttaatctgt taatgtaaaa tttttataaa aatagttata 19260 

aataatatta gcctacatat ttcttctgtt ctctttttca actcttagaa tcagagtatg 19320 

gtagtctcag actaaaccag gagcttccta tctgtttctc tgttcttaag tcacttatat 19380 

aatgtaagga tgctgtgtat atctgccagc taggccttat atacaaaagg cacccatcac 19440 

aaccttctaa aacagtctta ccacttagag accatgttca aacatatggg cctttgaggt 19500 

aattgccaca ttcaagctat aatattgtta tctaagggaa tatcttcact tctagcagat 19560 

gcctaaaaat atctaaaggt aaacactggt aattgctgtg tttgttgatg ctgctcttcc 19620 

tcctcctcct cctcctcttc ctcctcttct tcctcctcct cttcttcctc ctcctgcttc 19680 

tccttttctt catcctcctt tcttttctta tttttgaggc atgatttcac catgtagccc 19740 

taggtaaccc gtaacttact atgtatgtag accaggctag cctctgtctc ctgagtgctc 19800 

atattaaagg tgtgtatcac catatccagc aacacttgct ttgagatggt tagaggaaaa 19860 

aaaaatatac gtaaataaag atggatgcca attactaaat tgttacttcc agtcaaactt 19920 

tgtacctagt ctaaggccaa aatagggatt ttttttctac tttgcaagtt ggctccatta 19980 

agaggctttt cttctcttgg tctcactaga taggaaggag agagaggagg gaaggagaga 20040 

aagcggttga ggagtgggag gtagtgtgac cgagaatacc cagtaggctc atatatttaa 20100 

atatttggtc cctagttgat agaactgttt agaaagatta ggaagcatgt cttaggggct 20160 

ttgaggtttc aaaatttaat gctagaccca gtctttcaag ggagggggcg gtctgtctct 20220 

ctctgcctgc tgcatgcaga gctctcagct actactctag tgtcaagcct gtgtgcttcc 20280 

tgcctcaatg atcataaatt aactgtaagc aagcctccaa ttaaatgctt tcttttatag 20340 

ttaccgtgat catggtgtct cttcacagaa atagtaacct gtggtgattt taatatgcct 20400 

ggaccaggga gtggcacttt taggaggaat ggccttgtta agaggaagtg tgtctctgtg 20460 

ggggtgggca atgagaccct cgtcctaacc atgtgagaac cactcttctc ctattggcct 20520 

tcagatgaag atgtagaact ctcagatcca cctgcaccat gtctgcctgg aagctgcctt 20580 

tgttcccacc ttgctgcccc aattaaatgt tgtacttata agaattgttt ttggggggct 20640 

ggagagatgg ctcagcagtt aagagtactg actgctcttc cagaggtcct gagttcaatt 20700 

cccagcaacc acatggtggc tcacaaccat ctgtaatggg atctgatgcc accttctggt 20760 

gtgtcagaag acaggacagt atacccacat acattaaata aataaataaa taaataaata 20820 

aattcttttt aaaaaagaat tgctttggtc atggtgtctg ttcacagcag taaaacccta 20880 

acataaccct gactaagaca acaagtgagg aaaggtgttg tgtgacactc tggatctctg 20940 

gaagctcacc tcagcatgaa gcttgtcgaa gcgnnnnnnn nnnnnnnnnn nnnnnnnnnn 21000 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 21060 

nnnnnnnnnn nnnatctaag tacactgtac tgtcttcaga cacaccagaa gagggtgtca 21120 

gatctcatga cagaggttgt gaactcagac ctttggaaga gcaatcagtg ctcttaactg 21180 

ctgagcatct ctccagccca aaataattct tactagtaac atggaacaat caagttttat 21240 

tatatgatac atattaatca acttataagt acatgattat gcacatttat catatcgtgc 21300 

aaccatcact gctgtcgttt tgttttgttt tgttcttttg aggcccggtt tctgtgttgt 21360 

tctggaactc actctgtaga ccaggctggt cttgaactca atgatctgcc tgcctctgcc 21420 

tcccaagtgc tgaaaacaaa tgtgtgcacc accacctctg gctatcactg ctgtcttttt 21480 
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ttttttttta acagttattt atttcgtgca tgcatgtgtg tataagcatg taacgtatgc 21540 

catggtatgc atgtggaggt cagaggacaa ctttcaggag ttagttcttt cctcccactg 21600 

tgggttctag gaaccaagct caggttgtta gacttgcatg gcaagtgcct ttaccacaga 21660 

gccatcctgc tggccctact ataggtcctt atataaaaag atcatatgcc gggcaaaaac 21720 

caaacaaaaa ataaacctca aaaaacaaaa ggaccatata atattgtggg ggagtggatg 21780 

aagtcctgaa cgaatgtgtt ctgttgacat gtctgtactt cagacccatg ggaattggca 21840 

aagccttcct ctggtcctgt gaggatgctg atagtctgtc taaaaactag agatcacagc 21900 

tttctcctct ggatgactgt aaccccagat tgttcctctt cagagactgt ccaccaagct 21960 

accctgccta cttaagctgt acacaatgaa tgagctgagt ttccaggtta cagcacagta 22020 

gacactgtcc atcagtgaga gcacagccta gcctaacagt acacatgtct gctttcttca 22080 

cgtttccaga accaagcctt gctggataga gcatatttgt ctgtttggct tatttcactt 22140 

gataaaaagt tttcaaggag ggccaggtgt ggtggcacac gcctttagtc ccagcactcg 22200 

ggaggcagag gcaggcaaat ttctgagttc gatgccagcc tggtctacaa agtgagttcc 22260 

aggacagcca gggctataca gagaaaccct gtctcaaaaa accaaaaaaa accaaaacaa 22320 

aacaaacaaa caaacaaaca aaaagccaaa aatccaaccc cccccaaaaa aaaaaccaaa 22380 

ccaaaaacca aaaaacaaca acaacaaaaa gtttttgagg tttaatttat tgcatgtcac 22440 

agaatttcac tgtttaaaaa aatggctgaa taatatttca ctatccattc acgtatttgt 22500 

aggcattcat gtgtgtagtg gtttaaataa aaatagcccc cataggcttc tacagttgaa 22560 

tgcttagtca ttgagtagca gtactagaga gggaattgaa ggtgtggcct tattggagta 22620 

ggagtggcct tgttgcagga attgtgtcac tttgaggtcc cagcaacaag gttgctctga 22680 

tcacatccaa agacattcta ggtctatgtg atctggctgg aattcagaca tgcccttaat 22740 

acacaccttt aatcccaaac aatgaaggta aagttagttt ataaaaagaa gcacccatgt 22800 

ttgaaagtga cgtttaatta agagtgatga attagagaaa gatctgctgt cacagagcag 22860 

agaggaaaga gaggcagcat aagagggagc atggcagagg gagagggagg aggggttttc 22920 

accagggcat ttgtacagag acaggttgca gagctagaac aggtgaagac agaacaagcc 22980 

agagaatgag aaggagccag gagattagga cagattgcca atgttaatag gctaagcaga 23040 

gcattttagt cagaaactga gagaagtcaa attgaatcag ttagcttgga aaggagtttg 23100 

agcagcaaca gctgagttaa actagccaac agaatccaga aagaactaga aaagatgagc 23160 

ttactcagca gcaaatctca gaggctaaaa acatcttaga cctagattag actgcatgga 23220 

ggctagacgc ttccagggct aggcctaggt tagcagacgg agagagtaat aagccttgga 23280 

gacaacagtt aatacagaag actatgtaca gacatggata tgaacctctc agccacttct 2334 0 

ccagcgtcat gcctgtctgc attgttagga gtcatctagg aaaggctaag ggcaggcaag 23400 

caacttttcc agagatggtc cactgttttt tgcatggctt ttgagaggcg agctctgaga 23460 

gggaaggttc caagagactt catcccagga ttgctgctta attacgacat gccttttctt 23520 

gtcactgtta tttagtataa tgactcctga gctttagccc atcctattgg gcatatttcc 23580 

tgcagatcaa cataaagatg aactttcaca aattaatgct gtttagatga ataaatgatt 23640 

ttataaaatt cctgatttga tttaaataat tttaggaaga aagctttagg agatagttta 23700 

gttggtttgc cagaaagatg taataacgtc agaatcaaga atagaatgtg gctgggcagt 23760 

ggtggcagat gcctttaatc ctagcacttc ggaggcagag ataggcggat ttctgagttc 23820 

gaggacagcc tggtctacag agtgagttcc aggacagcca gggctacaca gagaaaccct 23880 

gtcttgaaaa acaaaacaaa aagaaaagta agtaaaggct gcataataaa gaatacaatg 23940 

agctttcaca actacaccaa aaagagacat gcttgggaca aatttgtgat caaggaaaaa 24000 

tattcattct agatcaggtc caaggatgaa gccacaagtg tgtgatatga tgaacaagac 24060 

catggataaa ctgttgtttt gagcttaaag aataaaacac tgctttgaaa ttaactatca 24120 

acattctact gtaactttcc tttttataaa ttttatctat gagataattt tctaaagaac 24180 

ttgtgtctat aaaggtatag aaggacagag agaaagaaat aaggtgtggc atctgggctc 24240 

tgctccatcc acccaaataa atatgtgtgt gtgtgtatgt atgtatgtat gtttatctat 24300 

atgtatgtat atacatacat gtgtaggtag gtatatgtgt atgtatataa gtatgcatga 24360 

acacttggga agttgatgag acaagtgaga ggttgggccc ccnnnnnnnn nnnnnnnnnn 24420 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24480 

nnnnnnnnnn nnnnnnnnnn nngaattcac tctgtaaacc atgctggcct tgaactcaga 24540 

gaaccgtgtg cctctgcctc taaagtgctg ggattaaagc atgtaccacc acaacccagc 24600 

tagtttaaat gtttcttatt tttttgttta tgggtctttt acctgtatgt atgtgtgtgc 24660 

accatgtgga tgcatggtgc ccttagagtc cagaagaggg tatcagatcc cctggaactg 24720 

gagtgacaga gggttgtgag ctgggacttg aacctaggac ttctaaaaga gcagcaggtg 24780 

ctcttaatag ctgagcctta tctccaggcc gtcccatgga tttggggggc tttgtttcat 24840 

tttattttgt tttgagacag ggtgtgtagc tcatgcttga atttactatg aagccctgac 24900 

tcccctcaaa gtaaagatcc tcctgcctct gtctacagct gctaggattc gaggtcttgt 24960 

accacatgct cagcacagcc atgattcata acaataaaaa aagaaagaga gacctaaatg 25020 

gccttagaga taaataaatt attttttttt taaagattta tttatttatt tattacatgt 25080 

aatgcacact gtagctgtct tcagaccccc cagaagaggg agtcagatct cattacagat 25140 

ggttgtgagc caccatgtgg ttgctgggat ttgaacttcg gaccttcgga agagcagtcg 25200 

ggtgctctta cccactgagc catctcacca gccccgagat aaataaatta taatgtatgc 25260 

gtaaggtggg atcatctcag tctccgggaa tcttgcctgt tactccttcg ctctcccttc 25320 
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tattcatgct tgggtaactg gccctggctg attgatgaga gctgatttcc ccactgccct 25380 

gtggcaggga ccactgcgcc cacagggctc cctcaggatc ctcagtacag agctgcacag 25440 

ctgggtggaa gtagagggct gcatatataa cacgatctca actttatttc tttaaataaa 25500 

aattttattt aaattttata cagctctata taaacgaagg aactattgaa ggttcagcaa 25560 

ggacctgcca acggttgtca agggtaatgg cgatgtagtg attttttttc ccccttccat 25620 

tttacttcca tactttctac attaccccac aactggcaag tattatttta aaatgaaagt 25680 

aaatagtgac agatgacttt gaaggaaaat tgaatcggta aaaagaaagc tgagagacca 25740 

cccgggaagc ccaggctaaa tgtaatctgg gtcaggcctc ccaggcctgg ggtctcaaga 25800 

tggtcagctg agggaccctg gtgaccctct tgggccagca gggacgggga ggagccggaa 25860 

gctgagtacc caaagtgctc ctctgggctc caagggcctg cacagagact gtgtgggaat 25920 

caaaggatac aggcatgagg actgaggcct gacgaaccca gctatcattc gtcctagaac 25980 

aggaggcaga gctccaagag tccaaccaag aggcaggaag ctttgggacc cgagatgggc 26040 

gatgggatta gaaaggcatg tttgcaaata ctttcaaatt tacgatgcac actcactgga 26100 

aaccccaccc ctgggtgtcc cttccctgcc tcttgccaca cccaatagct gacatcactg 2 6160 

gagaaagtcc caagaccagg ctggctggag ctcctgatag gttccaccct cctgcagagg 26220 

gccctcgaag actagcttgc tcgcccacac cgccagatgt ctgtgtcttt ctctcttttg 26280 

cctcccaccc tcgtctcttc ctccaacctc agtggagggt cccctgcttc ctggggaaag 26340 

tagaacttgc cagtgctcac tgtaatgtcg tccctgtagg tgtcatggtc ccccattact 26400 

gggagcaggt atgcctcaga tctccctcta ttcgctgccc tttcaggctg tctcagtttc 26460 

tctctgacag ttcctctcct cctgaatcct gcttgttggc atgcgaacag gctcaatatc 26520 

ttccatctca aaaaacaaac actgggaagg tgttgagaga cagagagcat gggtaatggg 26580 

tgccccagct tggctgggaa ggggtaactt acaatgctct actgcccagt agggtagctg 26640 

cagttgtcaa ttaattgtaa atttcaaaat agctagtaga gaggatttta gatgttccca 26700 

atcccaacac aaagaaatga taaacattca aggcgatggg tatgctaatt gctctgatct 26760 

gatcaccgca cattgtatac atgtttttga aatgtcaggc tgtaccccat aaatatgtac 26820 

aattaccgtg cagtgattca agataaaaac tataatttta aaaagctaaa aacagaagga 26880 

aatagctgcc cttgaccccc ccacccccac aaggtccttc- ctgtttgtcc agccacttaa 26940 

tgtcagagct tcctgtggga gggtggtttt ggtgtacaca gacactcctt cctccctcct 27000 

tccccataag aggagtcacc cctgtcccac gatgccatgc' agggccacat gcgtgatatt 27060 

aaccagtaag atgtgagcag ggatgatacc tgtctcttat aacaaacgga aaaaaaacca 27120 

caccaaacca aaaacaaaca aacaaacaaa caaacaaaaa cagggttggt ctgtccctgt 27180 

gtcttttccc acataaagtt aagcacacaa agtagccacc atttatttat ttgtcccctc 27240 

ccccacccct ccccgagaca atgtttctct gtataacagc cctagctgtc ttggaactca 27300 

ttttgtagac caggctggcc tggaactcac agagacacag agattcacct gcctctgcct 27360 

cccaaatgca gggattaaaa gcatgagcca cgaactaacc agtaccccag agctcttgac 27420 

tctagctgca tacgtatcaa aagatgacct agttggccat cactggaaag agaggcccat 27480 

tggacacgca aactgtatat gcctcagtac aggggaacgc cagggccaaa aaaatgggaa 2754 0 

tgggtgggta gggaagtggg ggggagggta tggnnnnnnn nnnnnnnnnn nnnnnnnnnn 27600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 27660 

nnnnnnnnnn nnnggcagga tcctgtgttc atgtgcaaca ctcgatgcaa gctgtgtagt 27720 

gtttggttct gagcacctga aggggaccaa gcaggctgat gcccaggcca cgggttcttt 27780 

ctggccccac tgcccactcc caccctctgg catccccatg atgaacatgg ccacagatca 27840 

cactactctg ctcctctccc agatccacgg agccataggg tccccagatt catctctgca 27900 

gctaacaagc tgggcagtgt cacctccctc aaggttcctt tcct'gctctg agcagcagtg 27960 

tctcccacag tgagacactc atgtccactg gaagatattg tagccattaa attcctgtgc 28020 

taaaataact agggggactt gtcaatcact acactcttag ccccggactt ctgactcata 28080 

gagggtggtg acagctcagg gacctgcatt ctaccaaata gccatgtgtc cctgatggag 28140 

gaactgcccc tggacaacct ctgcagcaac tgaaccctct gtggtctcct agttcttctg 28200 

gacaggtgtg accccagtac ctagtgccag gtgagagagt gctagggcca cactaagggg 28260 

tgacaggaca aggttggagc tggtagatgt ttgggccacc aaagagaaca ggtcagtagt 28320 

aaaagccatc atggcctgag ccagcctgcg agtctcctct gcagttggga cactcttgca 28380 

gtgtcctggg gacctcttga gggtagcatg gtcaccaaaa tcctacaagg acagatcaga 28440 

agtcagtgag gtcaagggaa cagctctagg ttctctgtgt ccctcacgga cctttttttt 28500 

tttttttttt tttttaagat ttatttattt attatatgta agtacactgt agctgtcttc 28560 

agacagctcc agaagagggc atcagatttc gttacggatg gttgtgagcc accatgtggt 28620 

tgctgggatt tgaactcagg accttcggaa gagcagtcgg tgctcttaac cactgagcca 28680 

tctctccagc cccctctcag tcctgatgcg acagggcagc aaaggccttg tcccagatct 28740 

gaggagagtc atgctgaagt ccttcctacc ccaccccttc cgaacccctg aacatcagcc 28800 

ccataactac tgactccccc acccccattc ccttgcttcc actgatccgg tcctcctctt 28860 

ccctctggcc ccacccattc ttccccagcc ccacctgatt gtacctggtt gtccaacttg 28920 

aagagggcag gcaggggcag cttctgctgg gcctgctcac tcactggctg tagaaatgag 28980 

aaaggagatg aagaaaaggc ccttcccatg ggtccccatc ttgccaagac ataggtgagt 29040 

ccctttggct cttcccccta aacctctcac ttttgagtac ctgctggccc gggagatcca 29100 

cggcgctcac cggagagaac tgttgagaaa agggagaaca gagaactcag cgttcctccc 29160 
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tctccaccct tctggcctct cccagatttg cccccgcccc ccagcatctc cttcagcctg 29220 

actgaccact tcccactcag acctcagctc tgcctcaccg tgaaacaggg accttgcagg 29280 

caggacaagc tgagtacgag gagcccccgg agcagtgcca tgttcctgta tccagaacag 29340 

ggagtgttag ttcctacctc acgctcgaag gccaagcagt agactgctat ccatgggttc 29400 

cttgaccgca ccaggctgcg gaacctggac tcaaaacata gcagctgtgg acctcactca 29460 

ctctgagagg tgggatttcc ataagctttt tttttcacct gtacatttag tcttcattct 29520 

tttcgtctta cactgtggat cagtcctggg ttcaaattta aagccctcat cttgcaagag 29 ^° 

gaccttgcgc atctcccttc atgcctttgg ctttaccctg tcttggtaat tcatggcaga 29640 

agttcttcct gctcccatgt agatgttgag gacccaaata agaatctctg taaatactga 29700 

gcatgatgcc tggcccccac cctagcaaag ccacctgacc tgttgttcat ttcatccagc 29760 

ctttctcagg ctgccctggt cctacccaaa ggctctgaga gctaatctgg gctggcaggg 29820 

cagccagaaa cttctttgtt gaccaatgaa tgactggccc agacacc.ttt ggacttacgg 29880 

gaactacaag cctcatccca cttctgctcc aagttctgat ccagggtgct tcggggaagc 29940 

ccagctggcg gaagggggga ggctctcagc ctagagagcc ttcctttcca tcctcagccc 300 ^ 

cctacccagg ccttatttca ggcaccagct cttctaaaag gtccttctgt tatccctaga 30060 

cctccacaac tgtgttcaag aaccttcagc cagggcctca tctccaatct ggatatatga 30120 

tttttctcgc caagagtagg cctccaggtt ttggagttct agaggtttct cctggagctg 30180 

cctggacctc tgctcctcac caccccagga cgctgtgaag ctgcaggctc cctgaataaa 30240 

ttcatccaga ccccttgcca aggtgccagc tgtctacttc ctctgctgcc caagcagcag 30300 

gctgcaccac ccctccatcc tacctcttca ggcttcttag cgcagcacac gcagcacacg 30360 

gtgttctcct ggaccagctt gctccccacg ctcccccagt gcagccagca gggcctagct 30420 

ctctcctccc acaggacctt tgctctcagc caacccccgt tcagcttgtg ttcagtgctg 30480 

gtaaatattg acctgtacat ccggttaaac attgatatgg gggccagaag accctttccc 30540 

atcaaggcta cccagaaccc tgcctgagcc tggagaaggg gtttacagga gcagataagt 30600 

gaggaggttg ggcctggcaa gccttctaat gatccctcaa cataggggat tatccacagt 30660 

cagtgaggct cagagaggct gtgtggcctg tgtaagggcg cagagtgggc tccagagtca 30720 

cagccaaagt cccaccacca ccaccaccac caccaccacc • accaccacca ctaccaccac 30780 

caccaccacc accaccacca ccaccaccac caccaccact accaccacca ccaccaccac 30840 

caccaccacc accaccacca ccaccaccac cacctcatct acccatacta anttgaggct 30900 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 30960 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn tctatgaggg ttacatttta 31020 

gaacatctct ccttttttcc tttttgagac aatcttacta tatttaggct ccccttgaac 31080 

gtgtgatcct tctgcctctg cctcccaagt gctgaggtta caggcatgca cagtcacatc 31140 

tgcctacaca atgtcttagc agcccttagc agcaccaggg gtcaggaagc cctcaactgt 31200 

ccctttagct ggcttctctt gtgaagggct atgtcttctt ccccttccta gcatggagac 31260 

ggctcttagc cccagagcct tccttccttc aggttaaaca gcaccagttt ttggtgggac 31320 

ctcccatttc cctctatctc cctaagcaac gaccttttct gctctgactc tcatctggca 31380 

cttggaccac aagacaaaac tgcagcctgg gctgtgtgtc ctcgcacatc attcctgtgc 31440 

ccccctggag tcaggtctag gggaggaaga cagggttcac gactcagaaa agaccactgg 31500 

ctgtcctagt gtgccctcac ccatcctata gcacgcacat gctgatgtgc cccctccgct 31560 

ccatcaccat cctctcatgt acacgtgccc tccctcgcca gacacatgca tcactaactt 31620 

ttctgacttc ccagaaaaat atctgatctg agaagttagg agtctgccat catcagctat 31680 

ggtccttaaa attaagtcag acaatccatg ggacatgaag ggcaacaacg agaagactcc 31740 

tcgttccttg ttcactctgc ttttggcagc accaccagca ggaaccaacc tggctctccc 31800 

taatccctca tctatagcag gtctcccggt gggaatttta gggacctctg tgttctcatc 31860 

caggggcact gccactcagc tgctcaggga gagacccctt agaacaacaa agaaatcaat 31920 

gcagatttag gcttcttgtt tcccttccca gcccctccca tcacaggcaa cagcctccct 31980 

tggctgagcc tcaggaggct gatttatcag agaggtgctc agagaggcac ctctggtccc 32040 

tctgggtagg tagcaactga gacaggagga gatggtcacc ctgggcatcc tctaccagga 32100 

agtaaatgag atacccttgc agatgggacc cctgaagttc ctcccggggg cgggggtggt 32160 

ggtggtggga gtctaagtca cagatctttg ttaccacgtg gttagactga ggactgaatc 32220 

tgaggtggga aatctgatgt gcatggggaa acacagaggt ccaatgctgg ccaagagcta 32280 

caagcaggga caggtgctag ggggatgtct gaatgttcca ccccaagcca caggaataac 32340 

ggaaatggag actctaaagg gcagaaagtg agggtgtgca gcaggggctg cacaggacac 32400 

atgcaaggcc ctggctgcaa taactgggtt ggggaggcag tcattggcta gccaggggca 32460 

ccaggacagt gatgccatcc tgtccaaagg gcagtgtcca agccagattt ctaggctcca 32520 

gggggaggag ggtccgggga gaggggtcaa gattctcccc ctctgagtca aggttggcct 32580 

tcccatgtgc cccaaatcag gaggcacaga aactgggatg ttgtggtctc acatccaagc 32640 

tgagaagaca agtgggagcc agtacatgtg tttcagatta aacccagtcg gagacaaaca 32700 

tgttgctcct cctcctccca gagccaagct gccttcaagc cacatggcag tgaatatgcg 327 60 

gacagtgcag gggaggacac ctctctctcc actggctcaa ggacagtttc aaggggttca 32820 

ggctggctgg ctcatggcta cgccgctcac cccctggaca gtttggggtt tttccctcct 32880 

gaaatcttgg aatctgaatc agcctgagat accccataat tgtacctccc aacaccccca 32940 

gaaaggtcag ccctgcagaa cagaactctt tggtccccac ccatccccct cagccctgga 33000 



77 



WO 02/29059 



PCT/US01/31488 



ggctgaactg atgggcagct aaggtccaga cagtggctgg ctcttggaaa gcctgtctct 33060 

ttcctttgac tcagaccact ccctgccgtg gcttacatca ggaggtgcaa gggctgcagg 33120 

agggcagcca gaccccacaa accagctagg ctaaatggtg cttattgttc gcaagaggcc 33180 

atgacctcat ttgtctccca gctcttttgg taagagagaa tgagaggaag ctggacagag 33240 

aacctagcag gcctcaggca gcccactgct ccttgctgta agggaaccag caccgatggt 33300 

tctgaaaagc agcgatccga atggagtcag gctgagctgc aggaagctca ccttccttgc 33360 

tcactgctgg tggaagcaac ttcaggaaga gcccagccta tgggactata gctcctccgg 33420 

ggtactgctg agtccagccc cagagcttag ctccctgctt cccaccaccc accaccacat 33480 

cctttcccaa caccattcaa aaccccagtc cagcctctcc tactggtcta cagtgagcgg 33540 

ctaatagagt cctgggcctc tgtcccccca attctctctc ccctctcatc tgttcacctt 33600 

ggttcctaaa ctgcaggggc tactataacc ctacctccac ttccttgcac ccctcttttc 33660 

tgctctctgg ggtgcccctg ccactcccag tccctctagc cagggagcct cttccatatc 33720 

tgtcttcccc aggctagacc aggcgctgcc ttacctgtgg ttgcggcagc ttctctcaca 33780 

gcctgcactc tgaggggctc caggaagcag tgaggggagt agctgcctct caaccagcgt 33840 

ccagcaggct tcagattaca gctactcttt tcttaaagtg acctgactcc atttggaatc 33900 

tgtgattgca tcattgtctg gtgttaactt taacccactg ctgcccttcc gccatgtggc 33960 

tccaagacca cacgttggcc accctcctct cccaccacat ctcccttgga tctttatctc 34020 

tcttcattgg gaccttcatt gggacatgat ggctaacttc aggggcactt gggccagcct 34080 

ggggtaggtc atgagtctga acttgaacat ctgaaaggat tggctgagag gcaggctgca 34140 

tggagagact gtgagccagc cggtatggag atgctgggtt cttccaggcg cttggctctg 34200 

gctcactgca ggtgggagca aggtgattct tctcccctcc tcacctggaa aatgaaggaa 34260 

tgggactgta cctgacagct ctgaaggttc caaaggacag tggggtgggg actagagagt 34320 

tggcccagtg cttatgagca ctggctgctc tcgcagagga cctgagttct gttcccagct 34380 

cacatagcaa ggactcgaaa ctgcttggaa ctccagctcc agagaatctg acgctatctg 34440 

ctnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 34500 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtcagg catggtggtg 34560 

catgccttta atcctagaac tcaggaggca aagcagatgg- acatctgaat ttgaggcccg 34620 

cattgtctac atagcaagtt ctaggctagc caggtacatc ataagaacct ttctcaaaaa 34680 

ataaataggg ccagttggca aaatttagct tgccctccta acacaagaac ccaaagtcaa 34740 

ccccagcaac catgtaaaaa gaagcaaggt gtggtggcac ttgcttgtga tccagcattg 34800 

tcaaggtgga gacagacgga tccatggggc tcactggcca gccagctagc tggtctactt 34860 

agtatgctcc cagccagtga gagactgaaa aataaataaa taaataaggg gttaggaaga 34920 

ggtaacatgg tggctcagtg agaaaagata cttgtcatac aagcctagca accctcaatt 34980 

ttcagtggcc actaaaggtg gaaggagaga accaactcca aagaattgtc tcctgacagt 35040 

tttatgctgt ggtacacaca cacgcacaca cacacttgtt acatgcatat gcatacaatt 35100 

aataatttaa aatgttatgt gtatgggtgt tttgcctaca tgcatatctt tctgtgcacc 35160 

acatgtgtgc aatgcctgtg aaggctagaa gaggacatca gatcccctcg gagttacaca .35220 

gggttgttag ctaccatgtg gattctggga acaaaaccat gggttttcca aaagaggctc 35280 

ttaatcactg agccatctct ccatcccctc aatagtatat atttctgggg ctggagagat 35340 

ggctcagagg ttaagagtat tgactgctct tccagaggtt ctgagttcaa ttcccagcaa 35400 

ccacatggtg gctcacaacc atctgtaatg gaatctgatg ccctcttctg ctgtgtctga 35460 

agacagctgc agtgtactca tatacataaa gtaaataaat aaaccttttt tttttttgtt 35520 

ttgttttttt tgtttttcga gacagggaga cagggtttct ctgtatagcc ctggctgtcc 35580 

tgaaactcgc tctgtagacc aggctggcct tgaactcaga aatccgcctg cctctgcctc 35640 

ccaagtgctg ggattcaggg tttgcgccac cgccaccacc tccaaggctg ctgctgcggc 35700 

caccaccacc ccaccaccac actacctgac tatttaactt ttaaaggcag ccatctcatg 35760 

gaaaatgaca cctagcattg tcctctggtc cctacatgac cccatgtgca aacacatacc 35820 

tgcataaaca cacataaata cataagtaaa cttagtctgg ttgttttgga aatgtgctat 35880 

ggtttggatt gtgtcccccc aagggaaaaa ttgggtccca gagtagtact attggaagag 35940 

agtagaactg ttagggttta ggcctggtgg gaagtggcca tggctagaga gacatgccaa 36000 

ccaaggggaa tctctggctt catcttttcc ctttgctttc aggtcctaag acagtcacaa 36060 

ggctgcttca ccacatgccc agaagcaagg ggagcagtca tggctgggac cgctaatgca 36120 

gctgttgatt tccccagata tttgtagtag taagagacag gtgaggaacc ccacagcaag 36180 

tgttagtaat tgtgtgtgga ggtgccctcc ggggacgggg gccctcctgg ggcaggacgt 36240 

tcctcttcct catccacctg cactccgaga acaggaaatg gtgactttgg cagagcttaa 36300 

gcagagcccg ttcatgttac aagtatgtaa attcataagg accagtttct ctccatatga 36360 

aacagcttca aacaggagaa ggaagaagca aacattaagg aaaagctctt ttattgcaga 36420 

ggctacactg aagctaccgg ccgccttcct ggaatgtata atcagcttcc ctctgggggt 36480 

tctgtagagc actgagacat taagtactac tggggtccag gattctgcct atgaagagga 36540 

gggcccccgt gtccgtgtcc ctcagaacaa agaggaaagg ttggttaagg tgatagtcta 36600 

gcgggaaggt gaggcggacg ggctggaggc ctgggctggg gctgcttcct gccccctctt 36660 

cattccactc gaaagcagcc ctgtgttcca cttgggtgag cttcacgggt ttgccagtaa 36720 

tcttgctgaa gtcgggtgat tcaaacaacg actgtagctc tgtggagatt cagagattcc 3678 0 

attaacacca cacacacaca cacacacaca cacacacaca cactccctgt ttgtgtaggc 36840 
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tgattttcaa gaaagcaagc tagaagtgga gtacctcaca gtgacttgtg agctatgagg 36900 

cactctgtga caggctcagt gacctacctg agaacttata gccaagatgg ctgaagccag 36960 

acctggcctg agagaatgtt ttgggctgtt ataggacaca tagagataca cacacacaca 37020 

acacacacac acaccaagga ctgagtctaa tgggaggtgg ttcttcattc ccctcccctg 37080 

taatggtgtc acatgttccc tgagccaccc tacaaagaaa gccacaggac tcagttctgt 37140 

cagcaaggtg gcaggctcca agactcagcc ccgagcgcaa agtggccttg caaacatact 37200 

catgtcctgc agagacttgg taagttcgcc ttcgaagctc agcttcagct tggggacagt 37260 

cagcacagct tggatagtct tcagttctcg gtcgatgtca tgaatgaact cagaggtgag 37320 

gctctcttct atcatggtca agttctgggt cacggtcagg ggcaggaaga agatgatgct 37380 

- catacttcct gtcaagggca gctgggcaat ctaacccaac agagatgcgc acaggttagt 37440 

tgtgagccag aaaaaacaaa acaaacaaac aaaaaaacac caacagctgc cttcccctct 37500 

gctgtaacgg ggccccagcc ttgtgctccc cagcctcagc ctgggctgta ggctactggt 37560 

tactggcagt ccttccatga gtagggagtt ttcttctcag cctaaaaccc acagaagttt 37620 

aatgaacaca cgtttgtttg tggttccgct acggtttcta ttgtgataaa acatgactga 37680 

aagcaacttg gagaggaaag ggtttatttc atctgacaat tcgcagggtg tcttctcatc 37740 

actaagggga ctcagggcag gaactgaagc ggaagccgtg gaggaacgct gctttctggc 37800 

ttgctccccg tggcttctta gcctgctttt ttatgctatc cagaaccact tgcccaggag 37860 

tgacactgcc cattgtgggc tgggcccccc cacatcaatc actaatctag aaaatgaccc 37920 

acgggtttgc ccagaggcca gtctggtggg ggcattttat caattgagtt tcacccttcc 37980 

aaatgactct aacttgtgtc aagttgacca cacgaatcag ggcctggttc ttaggagctg 38040 

aagtggaatg tcccccagag actgcctgcc agcactgctg accatttgct ttgtatagag 38100 

cattgaacca gaaatgaaca ataaaatgga tcctttgaac agatgtgttg atcctagggc 38160 

ctgtggacac agcgactggg cttcccagag cccccatgga atcannnnnn nnnnnnnnnn 38220 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 38280 

nnnnnnnnnn nnnnnnnnnn nnnntggcct catcaggtat cagagagaga gagagagaga 38340 

gagagagaga gagagagaga ggataaaagg ttagcccagt ggtggtggca cataccttta 38400 

attccaacac ttgagaggca gaggcagggg gagctctgtg ggccagtttg gtttacagag 38460 

taagtttcag aatagccagg gctacacaga gaaaccctgt cttgaagaga aacacacaca 38520 

cacacacaca cacacacaca cacacacaaa taagatcttt aagaagaaaa gaaaggatag 38580 

tggggaaaca tctgagcaga ggaagaaatg gggtgcgcag gacacccacc ctcagaggag 38640 

gccctcactg gaggtgtctg cacaggagaa cacttgcact cagcttgccc tagggcgtca 38700 

gaactcagaa ttcagtttca aagcactgac aggagcagtg actggggacc ccaggttgaa 38760 

tccccccttt atctaaaatg agtaagaacc aaaaaaacaa aagtgtttgg gatttggaat 38820 

ctgggttatt tgcatctaca aaagaggtct tggggaggaa acccagttct acccctggaa 38880 

ttcatcagtt tcctataccg ctgactacac aggggctgaa ggtaatctca atgttttcat 38940 

aactggtgtg gtgctacttg ctcatgatcc caacacttgg aaggtaggtc agaagttcaa 39000 

gagcagtctt gactactcag tgaatttgag gctagcctgg gctacatgaa aactcataaa 39060 

acaataaaag aaaagaaaag gtggcgagtg aggtggccca tcaggtaaaa gtgccaacca 39120 

cctcgcctga aagcctccac acggaagggg aaagccagct tctacatgtg gtcctctgcc 39180 

ctccgcatgc accatggctc gtgcaccccc acacccaccc acccacccac ccacatgaca 39240 

taaatacttg taatgattag tttctgaaga acaatatttt cgttgatctt gtttagggaa 39300 

caaagttcgt gcacattgac ctgtcgaccg tgtagtacgg gatccgctcc aggaagctaa 39360 

agattttggg atgcttcatg ttgcagctac tctgatgaac agtgttgctt tctgggccag 39420 

gtaatggtgg catatacctt tgatcccagc acttgggagg cagaagcatg tagatctctg 39480 

tgagttcgag atcagcctgg tctacagagt gagttccagg atatccaagg ctatacagaa 39540 

aaacccctgt ctctaaaaat cactaattta aaaaaaaatt cctttctaaa cctatataac 39600 

aaatgttttg taggctgcct taacaaagcc caatggccat tcagagaagg ctcaaaagag 39660 

aaagtttagg ggactctaca agcatcctca ggaaggccac agaaagcaga gcctgggcca 39720 

gtgagacttt gcagtgggca aggttcagct ctttatgtag gaagaagaga gtcaacagtc 39780 

agagtccagc tttccataaa acctgtgcag ggcctctagg caaagccctg tgttaggggc 39840 

aaaggcattt gcagtctaag cccggtgaca tgagctcaat ccttggaacc caggtggaag 39900 

gagtgtgctg actccacaaa tttgtcctct gatctataca tgtatgcacg tgcacgcaca 39960 

ctcacataca tgtccacatg cacacatgca tatacatgcg catgcgtgca cccacacaca 40020 

gggtcaaaag cagcaagaga tgccctgtga aaaacgtctc attcagtctc ccatcatcca 40080 

gtgccacact ctgagcacag gtggtactga tatcgttcct gattgatcga tcagttgatt 40140 

tgagaccccg cctcactatg tagcccaggc tggcctggaa ctcacagtga tcctcttgct 40200 

tctgcaagat gagcccatca tgcccaccat gttattgaag caataccatg ctctataaag 4 0260 

caaacctagg caggcaggat ggtggactcc tgtaatctca ggacttgaaa agtagaaggg 40320 

agatgaggag ttcacatcaa tctcccgtat gcgttggagg ctggagtggc tgttccctgg 40380 

gcgcttctgc cagcacctga ccaatgcaga tgcagatgct cacagccaac catcagactg 40440 

agctcgggac cccagtgagg gtgctggggg gaggactgga ggagctgaga cgggattgca 40500 

agcccatagg aagaacaatg tcagctggcc aaaccaccca gagctcccag ggactagacc 4 0560 

acgaaccgag gactgcacat gaagggatcc atggctccag atgcatatgc agcagaggac 40620 

agccttgtct gacagcatgg gaggggaggc cattggtcct gtggaggttt gatgccccag 40680 
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tgttggagga tgctggagcg gtggggcagg agtgggtggg taggtgggga gcaccttcat 40740 

agaggcaaaa gggatggggg agaaggcaga tgggatgggg gggttgtgga ggggtaagaa 4 0800 

agaaaaaaga tgtctctgaa agtaaaaagt acttgtcact aagcatgagg atatgagtca 40860 

acccccaagc cccacagggt ggaaggagag aaatgagtcc cacaagttat tttctgatct 40920 

atacgtgcaa tccatggcat acgcagaaac gcaaagacag acaatgagtt gggtgtggtg 40980 

gtgcacatgt aattccatca ttcaggagac agaagcagca gagttgttgg aaatctaagg 41040 

ccaacctaaa gacctacacc caaagaagga caaactataa ggaaaaaggt ggtcgaccaa 41100 

tgtaacatta aagttagaaa tctctcttca cactgtgtag atactgtaca aggaagagaa 41160 

aaggcagcca catcaaaaca gtgtaaatca acgagaaaac cagaaacaac tcaagagaag 41220 

gctgcagggg cctgaattct gttctcagaa cctgcatcaa gccaagagaa tcaaaactgt 41280 

ctgtaactcc agctccctgg gatccaacac ccatttctgg cctccatcag catcactcac 41340 

aggtgtgcac acatacacat caataaaaat caaaaccagg gatgaagggg tagggaggtg 41400 

catgtggatc tgggaggagc tgagaggcac tgggtgaata caataaaaaa tttggtgcat 41460 

ggtggtgcac gcctttaatc ccagcacttg ggaggcagag gcaggcgaat ttctgagttc 41520 

aagaccagcc tggtctacag agttagttcc aggacagcca ggtctacaca gagaaaccct 41580 

gtctcaaaaa aacaaaacag ccgggcggtg gtggcacacg cctttaatcc cagcacttgg 41640 

gaggcagagg caggtggatt tctgagttcg aggccagcct ggtctacaaa gtgagttcca 41700 

ggacagccag ggctacacag agaaaccctg tcttgaaata aataagcatt tgttgctgtt 41760 

acagaaaact ccagcccagt ttccagcaca cacagggtga ctcacaacat cataactcca 41820 

cttccagggg atccaatgcc ttcttctgac ctctgtgggc accaggattg catacagtgc 41880 

acagacatgc acataggcaa aacactcaca aaataaaata aatctagcaa aaaaaatttt 41940 

aactaataat ttaaagaaaa aaataaggaa gccgggggtg gtgtcgcacg cctttaatcc 42000 

tagcacttgg gagacagagg caggcggatt tctgagttcg aggccagcct ggtctacaaa 42060 

agtgagttcc aggacagcca gggctacaca gagaaaccct gtcttgaaat aaataaataa 42120 

ataaaaaata aggccaagta attcttggaa gaatcccaag gggacactaa gtgtatataa 42180 

aggcgttcca tagggctagg aatgaggctt agcgagagca acttcgctgg tgtatgaaag 42240 

tccctcagct gcatgtggta cctttaatct aggctctccc- gaagcagagg cagaaggatt 42300 

tctgtgagtt caaggccagc ctggtgtaca tagctagttc caggacagaa agggcgatat 42360 

aatagaaaca tcntacctag agcccngcca aanaaagggg agacctgaga ccagagagat 42420 

gactcagtgg ctaagagcat tgactgctct tccagaagtc ntgagttcaa ttcccagcta 42480 

aaaatttatt taaatgttta ttacttgtat tattatttaa atttaaataa ataagtaaat 42540 

gggagcctag gtttgagtcc ccaaatcacc aagaaaaaat gttatcattg ctaataatca 42600 

aattaagagc ataagaactt ctttttaaag aattcttatt tattttatgt atgtaagaac 42660 

actgtagctg tcttcagaca caccagaaga gggcattgga tcccattaca gatggttgtg 42720 

agccaccatg tagttgctgg gaattgacct caggacctct agaagagcag tctgtgctct 42780 

taagtactga gccatctcta cagctcttat caggttgata aaatttaatc tcgtggagcg 42840 

ctgagaccaa gaactaaagc tgggagattg aaaaatgcag accaccaagg ccctgctcat 42900 

ttctccagtt ctgatcagct cccgtaccag gggtctaacc aggcctgtgt ctgcttccct 42960 

gagtagacca gaggccccat ctaaacagcc tgcctgcagc agctcctctc tctaggtgga 43020 

cagatgggaa tttcagacca atgtcatttc ccaggacatc aacacagcag ccaaatttat 43080 

tggtgctgtg gctgccacag ttggtgtggc aggatcaggg gctggcattg gcacagtgct 43140 

tgattattgg ctatgccagg aaccagtctc tcaagcagca gctcttctcc tatgccatgc 43200 

tggggtttgc cctgtctgag gccatgggac tcttctgttt gatggtcgcc ttcctcatcc 43260 

tcttcgccat gtgaggctcc ctggggtcac ccagccgtcc ctgctgcctt gactccatgc 43320 

cagtcctggt gctggagtct actgagattt accattaaac agcaacgttt ctctaaaata 43380 

ctattaatta attaattaat cacgtgacaa ccccagcgtc catatgggtg tggaaaatga 43440 

ggaactctac ccatcataca tggcgactat gaagaacaat gtgacagaaa atgctaacat 43500 

catgtgtgac cgcatgcatc agccctgact gctaaaagtg gacaagcccg aagcgaaagc 43560 

ccaatgttct acttctaaat gcatgcacca aacgcctccc acaggaccag aggtgcagct 43620 

ctgatagggt ccttgcctgg catgcatgaa gctgtggaca cgaggcatta ttcgcaagaa 43680 

cattctagct gtctggaggg ccctcaatcc actgtgttcg cgctgttcca gcaccagtgc 43740 

ctcctggggc tgcacctgaa aaaggggact gcttaagagg gctcctacca agcctactgc 43800 

cacagatgca tgatgggaaa gccttctgga agcaactggc tgccaaaggc tctggacaag 43860 

agatcaccct ctactggaaa ggtggtttca gtctaggttc tgtgggattc caggaaatta 43920 

gacaacactg gcagtccaac agacagacga tctaaacttc caaggcacag ctggtagaac 43980 

ttgctgcgga accagacaac aaggtacgag ctactcccat acaacataca aaaaagcaga 44040 

gagagagtca gagacagaga cacacagaga gagacagaga gagagtctaa agagagtcag 44100 

ggtctcagga ctgagggtat agtctactgt agagcatttc cccagcatat acaagaccct 44160 

ggattcaatc tgaacacagg aaaaaagggg gggggggact tcgattatct caaattctcc 44220 

tttttgtgac acacccctaa agtcactgcc tacttccctc accgccatga agtaaagagc 44280 

tgtttgcgct tatgtctaca cagtctcggc tcccacttcc tcctcccctc tgcttctgtg 44340 

ctcatctcct ctgaaaccac tgcagcaagt gacttgtgtt gactgccaca cggaaactct 44400 

cctcagtagc aggcagcaga gcagagctct gtcttctcgg agcttcttct ctcttgtcgc 44460 

cattttctcc cacccttaag taccctatct tctctgtctc tgcttgttga tccttggacc 44520 
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cttttccttt ctatgaacaa aatatctcct taaaggatct cttctagttc agggtccccc 44580 

cgcccccact gtggagaaaa cccagggcct tgcacatgct cagcaggagc tccatccagt 44640 

ctctagctcc atgacttaaa gcatctctgt gctgtcaaat atacacttcc agcccttacc 44700 

aaaatattca gtcaactcct tgccattcaa aatggatgac ctcaaagcca gagtcagcgg 44760 

tgctatgact cccagatcca tccacttggt agcccaggaa tgaactcann nnnnnnnnnn 44820 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 44880 

nnnnnnnnnn nnnnnnnnnn nnnnnnnngg ttactgggat ttgaactcag gacctttgga 44 94 0 

agagcagcca gtgttcttaa ctgctgagcc atctctccag ccaccaccac caccaccacc 45000 

accaccacca ccaccaccac caccccacca ccaccccgcc ccacctgctc attcctgatt 45060 

ggttggttag tttagtctgt gagacaggag ctgtcccttt tctatagtgg aaggtgaata 45120 

agaaactcct gaaagtgaag gcctacaaaa cagccacact tatttgttgg aaaatactgt 45180 

aaatgtgaca tgtaaataca tgctaaaata attcgttaag tcagtgaaca accttaaaac 45240 

acagtctgta gcctgaatta cagacacgac acgagccatg acagaggctg aaataagacg 45300 

cctttgcaag gagaagggca gaagcttcca tccttgctag caaatctttg ttccaagctt 45360 

tatcagattt tattgctttc tctttctgtt tttctttatc atatttgttt atttgttggg 45420 

ggaaagccta atcttcatag cccatgtatg tgagcacttc agcatatgtg tgtgaacacc 45480 

aacagcacac gtgtatgagc accacagcat aggtgtgtga gtaccacagc acatgtgtgt 45540 

gagtacgata gcacgtgtat agagttcaga ggagaactga gagagtccgt cttttcctcc 45600 

tactgtgtag gtctcagggg tggaacttgg gctcagcctt ggtggcaagc tcctttatcc 45660 

acagagtcat cctgccagcc cagctttctc tttttctctc tgttatgtct atccactctg 45720 

ttcaaggcta actcactgac tctgagttat cagaactgct tgtgagagca ggagtaactt 45780 

tggacatctg tgctggtagg aacaccatcc ccactcggct tggatgacga aggggaaaaa 45840 

aagcatcacc aaggagttcc accacctcaa ccagcaaata tttacctcct atacatggat 45900 

aggtggggtg ggtgagcctt gtgatttatc gttaggatct catgggagtg attacagctg 45960 

gtctactcca tgaccaaaat ggtgacggtg gctgaccaaa aagaaacagc tacacctggc 46020 

tctagttttc tttctttctt ttttcttttt cttttacccc acggtactaa ggattgaacc 46080 

caggaatgca agagctctgc caagtgagct acattcccag- atctgttttt ccatttcttt 46140 

ctttcctttt agattttatt tcgatttatt tgtctatgtt tgtgtgtatg tgtgtatgct 46200 

tatgtgtacg tatcagtatg ccatgggtat acagaaacct' gagaaggcca gaagagagtg 46260 

tctgggttac aggagtttcg agctgtcctg tgggttctca aatgcagcag caccaccacc 46320 

aatcaccccc acccccaccc agccttcgag ttcaattctc atcatcacaa aaacacacac 46380 

acagacaagg gcctgcaaga tggctcagca ggtaacgaag ctcgtgtcat aagcctgaga 46440 

acctgagttc actgtctgga acccgcgtaa agggggaagg gaagaatcaa ctctatgatg 46500 

ttgtcctctg ctctccccat gtgtgccatg gaatgaacag ccctcccaaa cacacatcaa 46560 

gaataaataa aactaaaatt agcttagtaa cttttatgtt gaaagtggtt tttacatgcg 46620 

tgggcaacaa taacaccgag agtagaaagg caagcatgta tgtcactgaa cagcattgaa 466QO 

gaaaaaacaa acacatttcc tgtacatcgt tctgggagtc tgagttaggg tttctatgct 46740 

gggataaaaa caccctgaca aaaattaacc tggggaggaa gctgtttatt tcagctttta 46800 

tgtctacaac atgacctgtc acccagggaa gtcagggcag aaattcaaac aggtcagaag 46860 

cctggaggta gaagctgata cagaagccat ggagctgctt ctggcttgct ctagcacaca 46920 

ggagtactag cctaggggct gtactgccca cagtgggcta ggctctccca cagcaatcat 46980 

aaatttagaa aatgcactac aggtttgcac acaggccaat ctggtagggc cattttctca 47040 

attgaggttc cttcttccaa aaggacttta gcttgcatta tgttgacata aaaactagcc 47100 

agcatattgg gattatagat attctcataa aaaaaagaca tttagattcc cacataacac 47160 

catattcaga aattaactca atgtgaacca gaagctctga aagtaagagt taaaactatg 47220 

aaaaattctt acaaccatcc ataacaaaaa tctgatgccc tcttctggag tgtctgaaga 47280 

cagctacagt gtacacacat ataaataaat aaataaatat ttaaaaaaat atatgaaaaa 47340 

tcaggctggt gagatggctc agtgggtaag agcacccgac tgctctttcg aaagtccaga 47400 

gttcaaatcc cagcaaccac atggtggctc agaaccatcc gtaataagat ctgactccct 47460 

cttctggagt gtctgaagac agctacagtg tacttacata taataaataa ataaatctta 47520 

aaaaaaaaaa aaactatgaa gaactatgaa ctacaagaag tcaggaatag ggctgggggt 47580 

gtaacccaac agaaaaacac ttgcctggcc tgcgtttggt ctctagcacc accaacgtag 47640 

aaagagaaca gcagaggatg agggcatcct gacttgagtc aagtgacaag tgataatcct 47700 

cgagacacca aaatcacaat gataaaagag atcaacaagt tgggctttat ctgaataaag 47760 

agctgtgtcg ttaaatacca cgcaggaagt gaagaggagc tgagtctggt aacacaggcc 47820 

tgaaatccaa gctactgggt ggactgaggg aggacaacag ctagctcaag gcccacctgg 47880 

acgccagagt taactcagag agcagcttgg gtagctttaa tgagactctg cccaggccag 47940 

tgcacaggag agatggctca gtggttaaga gcattactcc tcttgcaaag gacctgagtt 48000 

caattcccag cacccacgtg ggcacttaca atcatccata actttagttt caggggatcc 48060 

aatgcccttt tcacagtacc aggcatgtac acagtgcaat tacatacata catgcatgca 48120 

tgcatacata cacaggcaaa acttacataa aatactaagc agataaatct taaaagaagc 48180 

cgggcgtggt ggcgcatgct tttaatccca gcacttggga ggcagaggca ggtgtatttc 48240 

tgagttcgag gtcagcatgg tctacagagt gagttccagg acagccagga ctacacagag 48300 

aaaccctgtc ttgaagaaaa taaaaaaaaa aaagaaaaaa atcttaaaag aaaaggagag 48360 
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gactggagag atggctccac agttaagaac acttgttctg aggtctacag agtgagttcc 48420 

aggacagcca ggactataca gagaaaccct gtttcgaaaa accaaaacca aaacaacaac 48480 

aacaacaaca acaaaaccac ttgttcttac agaggacttt ggtttgattc tcagaatcca 48540 

catgatggtt cacaaccatc agttgcaggg atccaaggtc ctgtcttctg tgggcaccag 48600 

gcatatatgt ggtgtacata catgtataca ctcatataca taaaataaaa agttttaaaa 48660 

aggaggctgg gtttgtagcg cagaggtaga ggtaaaaaga ctctagcttg tttaatgttg 48720 

acatgaaaaa aaaaagacat ttagattcct gcatcacacc atatccaaaa attaactcaa 48780 

tgtgaatcat aagctctgaa agtaagaata agcctagtat gcactgtaag gctctgggtt 48840 

cactccccag cactgcaaaa gatcatgaaa ccagaaatgc agatcctctg aaccacagca 48900 

tgggaatgta actcagccga tgcagtgctc acctgtcgta tacagagcac aggataaatt 48960 

gattgtggtg gtgcatacct ataagctcac tacgtggaaa gtagaggcag gacgaccaaa 49020 

ggttcagtga catccttggt cacatagaga atttgaggcc agtctggtct gctggtctat 49080 

ttggaatgct gtctcaataa ataaaagaaa gaaagaaaaa gaaaagaaga agtcctatga 4914 0 

ttgtcttaac ctctgacctc tgtgttcatc aagtctcctc ctcaggaact cactggtcat 49200 

cttgtgaaaa cctaccccag agtctctgtt cagaggaccc aggctccagc tgtggttacc 49260 

acataggatt tttatactag aaaaataaaa tgaataagta tgtatttttt aaaaaggtgc 49320 

agagctggat atggtggtgt ctagttatag catccagaac tgagacagga tagccatgag 49380 

gttgagaaca gctagactat acggtctcaa caaacaaaag taagggatct gagtagatga 49440 

ggttttaatt tttttctttg tgtttgttac ctaacgtgta tggttgtttt gaatacatgc 49500 

atgtctgtgt atcacttgtg tgcctgaaac ccaaggaagc cagaggaggg catcgggtcc 49560 

cccggaagta ttattacaga aggttgtgag cagccatgtg ggtgctggga atcaaatctg 49620 

aaagagccac ctcgggctgg agagatggct cagtggttaa gagcactcaa tggctgctct 49680 

tccagaggtt tggagatcaa atcccagcaa ctacatggtg gctcacaacc atatgtaatg 49740 

ggatccgatg ccctcttctg gtgtgtctga agacagctaa agtgtactca aataaataga 49800 

tcaaaaaaga aaaaagaaac agccacctct ccactctccc tttttaaaat cctcttgcct 49860 

ctgtccctta atgttaataa cacaggtata tgatactatg ccttgtttat gaatagaaaa 49920 

tacacgtgct aaagcaagtg tgaaccttaa atacattatg- ctgagtaaaa ggagtgagtt 49980 

gcacacaaga cttttctgct caagagtatc tgtatgaagt attgaacatg tgaactctga 50040 

aatcgggagc tgaggaagat atggggagtt ctaatggcta caacatttct ttttggaatg 50100 

atgaggatgt tctagaactc aaaaatggtg ataactcagc atatatacta aaactcattg 50160 

aattgtacac tttaaatgaa tgcaataaaa cttgtctcag taatgtggtt tagaagatgt 50220 

acagacatgt gtgtgtgtgt gttaaaacat ttcttggcat ggcaataaaa atacagtttt 50280 

agccaggtgg ttgtggctca aaaaataatg ataataacaa taataaaaat aatgaaaaca 50340 

gaggctggag agatggctca gcggttaaga acactgactg ctcttccaga ggtcctgagt 50400 

tcagttccca gtaaccacat ggtggttcac agacatctgt aatgggatct gatgccctct 50460 

tctgatgtgt gtctggaaac agctacagtg aaagtcattg caaggacttt acaatagtga 50520 

ccatgataac attgaagcta gacttgctac tactgctgag tgtgtctgct ggctctttct 50580 
aaggagtaat gttagctttt tgtcctaaat ttgtttcctt cctttcctct ctccctctgc . 50640 

tgttttttct tacccctctt ttactttgct ttcccctctc atctcctctc ttaacagagt 50700 

tgtcctatgc agcccaaatg ccatcttcct gcctcagcct ccccagtgtt gaaaaatact 50760 

ctttccacag gttatgttag gagactggag tctgctcagt cggggaggga gcctgggtca 50820 

agttctgagc tcaattcctt ttctttcttt ctttctctct ttctttcttt ctttctctct 50880 

ttctttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt taagacaggg 50940 

tttctttgta taccctggct gtcctggaac tcactttgta gctggcctgg aattcagaaa 51000 

tctgcctgcc tctgcctccc aagtgctggg attaaaggtg tgcaccacca ctgcccagcc 51060 

ctgggctcaa ttcttaacat tgtggagaga aaagtattgt agctgttctg gccacctgga 51120 

attactttgt ttctgatctt ttgctgcagt caaatccttc tcatccatct ttcctcgtca 51180 

ggctataata tagactctcc ttgcaatact tggaaatgct ctacagtcag ctacatcctc 51240 

agtcctgctc ctatattttt tcctaagctt ccttctaagg tctttattgg tttatgattt 51300 

acacagaaca tttttttttc ttgtctatag catgcgttag agtgatcgtt gccagataga 51360 

ggaaagagaa atgagagaan nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 51420 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt 51480 

cagctactga ttcctcctcc tccctcctcc ttcctccctc ctccccagcc tcatgctctg 51540 

ctcatcttgg acttctgcgc atgtcctcag cccagacctt ctgctcttgc ttctcctctc 51600 

cccagcagcc ccccagttct cttcctgaaa cttctgaggt actctccatc acctcctttg 51660 

gctcctgctc tgattggtgt cacctgctgg ataggcttgc tcctgactcc actgttcgtg 51720 

tctcaattag ggaccctcac cctctgatat accacacatt tccctagtgt ctccacctcc 51780 

cacccccacc ctatacgcac atacacactt agctgcatca ggatcctaca ccagggactt 51840 

cttacccttc taatcctccc caccggacac tgcccaggga cactggggct ccagagggct 51900 

attgccacac ggacacacag gagatctcat caaggagatg tgcctacccc agagggtagc 51960 

tctcaccatt cacaagcaca ccacttctgc ctccagcttc tactctctcg caggaagtag 52020 

ccagcccggt gccaagtatc cccaactaca tccccaaaat tctcagacac tgccagcctc 52080 

cagctgtcag cctggccccg gctggcgggc gcctgctcct ggcatagcga ctagggtgta 52140 

attagaaacc cgctagctcc ctaattgcca gttctgagct gtccttgtta ccggctgccc 52200 
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gaggcacaca tagaggaaaa ggctgagagc tgagccaggc tggcatggag gtagccctag 52260 

tagacctaga gaggactggc atgtggccag ggaccaaacg tggcacagag agggctcagt 52320 

gcaatctgcc ccgtgggtgc ctcccagcca catccatttg cccagaactg tgacgtcaaa 52380 

ccagcccggc ccattcattc tttattcagg tggcataaaa atcactacaa aaactttaca 52440 

aaagagtctt gggagctaaa gggtcccttc cttgcctcag tccccaagat tcctggcagg 52500 

ggaggacaag agagagaaga aggaggaaga ctcctggcag tgttggcatc tccaaatacc 52560 

agaggggtga cttgggtgac aggacacagg ttggggacct gaatgtcttc agcaagggac 52620 

actcttgtag ggtaggtcag cctccaacca tgaagtataa caccaaggcc agtctaagct 52680 

tgggagacca acacttgtct ctccttttcc cacccagggt gtctggaata tgtctaaaga 52740 

tggcctctcc agcctctgct tacaaatgtg gagggaccct aagttaggga cttgcctaac 52800 

ctacctctag ccaaaactgt gtccacaagt gccagcccac aaaagatcac cccctgagcc 528 60 

ccttgggaag aaatgaagat tccccatgcc tgccttcctc caggccccac cccacctgct 52920 

gcaagagaac agcttctaca ctggtgatgg tccttccggt cccaccctat cccacaaagc 52980 

tggttagaaa gagtcacagg agctgagagg ctgatccagg tggggactca ggatgctgct 53040 

gcccagggcc cctcctcact tgggggagct gaactggggg tagtcttcct ccatgcgggg 53100 

tgcaagtttc aagtcaggac caaaggtctt gcctccatgg aagtcagctt tgtcattctg 53160 

gcctatgagc ctgttgtcag gggaatctcg ctgttcctgg agctggggca gcgcgctggg 53220 

gttagggttc ctcacactgc ccacaaagag gggcacgcct atggtgtcct ccatgatgaa 53280 

gaagaggaag ggtcggttca cagtgaagga ggagagggac attcgattca tggctacgct 53340 

ggtagctgcg gctgcctcca caccagcctc gctgagctcc atggtagact gatgttgcac 53400 

gctagacacc accagattct gctcagagat cccacgaagg tctgggccct ggaacaattc 53460 

ctgcaggcct gcccagaaca gcagatgact ggtcagtgct gccccaaggc tatgtggatc 53520 

tgtctagcat cctggctaaa gggaacactt gaacccagcg gttgattgga atctgttaga 53580 

cctcagtcta gacaacactt ctagaaacct tttttttttt tttttttttt ttttaaatca 53640 

ggatctgcgc taggtacagg acagaaagtc tagaggagca tatcaaatgc tcccatccag 53700 

gaagcagggc cacctctggc tcaggcacac tggcagctcc cgtactctgc ccagaccacc 53760 

taggggcacc ctatccccaa gctccttacc cagttggctg- agggtggcca ccaggtccag 53820 

ctgctgttgc agatggagtt taggcagcca caccttggtg ggcctctcct gcagcgaggg 53880 

atggtacaga gtatcccagg tcaggttggc tagtacctcg gacacgttcc actcaaaata 53940 

agtgggcatc acgaccacaa agctcatgtt gttcttaaag gggaaatgag ccacctacag 54 000 

ataagaaaag gagagaacat gaggaccaga cagcacctgg acctgtctgg agtctgggcc 54060 

aaaattactt ctgtactttt gagacaagag ccagaaattc agggttagca tgctttcact 54120 

taactggtga agtggaataa taccacttac ccctttgcaa ggtgacatgg gaccaaatga 54180 

gataatgctt ttacacctct ctgtgtgcac acataagcat atatgtttgt atcggtgtga 54240 

gtgtgtttgc tcatgggtat atggagtcag aagtaggtaa acatcagtcg tcttcctaca 54300 

ttgctctcca cttttttttt tttttttttg gtgttgccat ctttttgttg ttgttatttc 54360 

aagacaggct ttctctgtgt agccctggct gtcctggaac tcactctgta aatcaggctg 54420 

gcctcgaact tgcagagacc cacctgcctc tgcctcctga gtgctgggat ctaagatgtg 54480 

tgtaactaca catagctccc tcttttttgg acacagggtc tcatggatcc caagctggct 54540 

ttgaaatgac tgtttggggc tggagagatg gctcagcggc taagaacact gactgctctt 54600 

ccaaaggtcc tgagttcaaa tcccagcaac cacatggtgg ctcacaacca tccgtaacaa 54660 

gatctgactc cctcttctgg agtgtctgaa gacagctaga gtgtacttac atgtaataaa 54720 

taaattaatc ttttttaaaa agagaaagaa atgatggcta catacttctc tctcgtctct 54780 

ctgccccaag tgctgggatt acagagctgt acaacaagcc caagtttgtt gtgttttaga 54840 

catgctaatg tatcccaggc tgtcctcaga ctctctatgt aattcagaac gaccttgaac 54900 

ttcttttaag gtttattttt atcttatgtg tatgggtatt ttgcctgagc atttgtctgt 54960 

gtaccgtgtc cttgcagtac cctcacagtc cagaggaggg caccatttcc ccctgaactg 55020 

gttgtgagct gcatggtggg tgctgggaat caaaccctgg tcctctgcaa gagaagccag 55080 

taagtactct taactgctga gccacttctc caccttgagc ttttcttcct cctatctcga 55140 

tctaaaagta ctagggatgg cggatgtgcg ttcatgtgcc tggtttatgt gttgctaagg 55200 

gttgaacaaa gggctttgtg catgccaggc aagcactcaa caactgagct acacatcccg 55260 

acagactttg actcttctag tagtagtgtc tccactacag cctgagttct ctatctgctg 55320 

tcagcaagct gtacaaacaa gctatgggcc ttcctgtcct tgcctctcag ttctctccgc 55380 

aggtggggct actggctttc aaaatgaccc atagaggagc cacagcaaac agtaggaagc 55440 

ttgcccctcg tctttcaccc tctcccagag agtcagctat aattcgagtt tttttttcct 55500 

ctctctctct ttaaacagga tctggttatg tggccctaac tatcttcaac ttcagtcttc 55560 

ctgcttcaac cttctgagtg ctgggattat ggtgtaagcc accacactca gctcacacaa 55620 

cctttttttt tttttttttt tttaaagaat ccatgcagtt aggacagcat ggaaatgacc 55680 

aggctcaggc ctccctgggt accagcataa tgcctgcagg cgggtcctct gccagtgggg 55740 

ggatggaaag atggagccag aggatctttc ctctctgaac ctcaatgtcc cacagtgaga 55800 

cactcatgtc cactgggaga tactgtagta ttcaaggaag aagcaacagg aaggtgagag 55860 

ctaagtggag ctgagcaggc tcgtatcctc tcaccacggg ctacagagaa gtctggctgc 55920 

cccctccaca tggctcctcc ctgcagaact ggcaatgctg ggcccggctt gcccagtcaa 55980 

actaaccaac agaatggatg agcatgtgtg gtgccacaca cctgggaccc cagcactcag 56040 
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acagctgggg cagaagggtc atgagtccaa agcgaacttg tgtaacattg tcagaccctc 56100 

gaacaaacaa aactagcccg tcctgttatc tcagccacag atgatgggcc caaggatcag 56160 

tactctagcc aaggagtcac ggttaggcta gaagcaaggg aagccttagc tgagacagct 56220 

tggcacggag cttcatccaa tcagaatgtt cagagcaata agctttgaaa cccgacttcc 56280 

atctatgaag cactgtgtgg gaactcctct cttcccttac gagcagggcc ctggtcctct 56340 

tgggctccgc taaaacccca gcacagagaa cagttacctg gcacgtgaca aaaactcaat 56400 

atattttctt tgaggagatg aacctcaaag aagctgtgtc ctggatagac acagcataat 56460 

aaacccttca ggagctacct acccagggac cagactttac ctcccagtac caggcctcgt 56520 

ttgccagcca aaggcaaagt ccagactgac ctgtatctca ggttgctcca gcaggaacca 56580 

tcgaagagga tatgacaccg cgtgcatcat gtccaccgac actgtgaacc gctcatccag 56640 

gtggaagaaa tctttctggg tgaggctcgg gtcaaacttg gtcctccaga aacctgcagc 56700 

caggcagagg gcaggagcca tgtaacataa aatcagcctn ctgcctgtct tgcctagaac 56760 

ctatnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 56820 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnaaccaa ggcaggtctt 56880 

ggaaaaagga atcttaaatt agaagatgcc ttgataagat tggcatgtag gtatgtctca 56940 

ctaatgattg atgtggaaag tcacgaggga tggtgtcacc ctgggcagat ggcctggggt 57000 

atataaaaac acaggctgaa caaaccacaa agcagtagtc ctcaatggct tctgctttag 57060 

tttctgtctc aggttcctac cttgacttcc ctcagtgaag gcatgtcaca tgagagttgt 57120 

aagaggaaat aaaccctttc ctccccacat agtttttggt tatgatgtta tatgtcaaca 57180 

acagaaacta taactaatat agttggtttt ctttttttgt ttgttttgtt ttgttttgag 57240 

acagggtttc tctgtatggc cctggctgtc ctggaactca ctttgtagac caggctggcc 57300 

tcgaactcag aaatccacct gcctctgcct ctgcctccca agtggtggga ttaaaggcat 57360 

gcgccaccat tgcctggctg gttttctttt ttttttaata catttataat gcattttaga 57420 

tttaaaaaaa aaaatggcca tggcatataa tataaaaaga agtgcttaca aatcaccatg 57480 

tgcccttgcc ataaattatg taaaaatttc catatggaca tcagtctcaa gcttacaatc 57540 

tcagcactca tgagcctgag gcagaggcag gaggatggtg agctcaaggc cagcttagtc 57600 

tacataacaa gatcctgtcc aaataataac' aacagtaata atttcataca tagaactaga 57660 

aggggccact gcaaagacag tatgacaaaa ccactggccc tgcctaattg tattttaaat 57720 

aactgtcctc ctctctgtaa ttttcagttt ctaattttta' cataactacc atgtattctt 57780 

tttgtaattt taattagttt tttaataata gaaacaagct aagtgctaag aatattttca 57840 

tatgaacatt ttcaaggcac ttgatacata cctcagattt gccctccagg tgagcagtac 57900 

caattacgtg ccaccagcaa tgttagcttc cttttttccc taccatctga ttctgtttca 57960 

gtctattcgt agttctgatc ttgttatatc cctttttatt gtttccctgg gttccaacac 58020 

ctcccagttg agtgttctca ttgaatttca ttagcagctg tttcattaat ggcacagaag 58080 

aaggattaca gtgttaacta ggatagactt tgacaaagaa ctatgagaac atatcttatt 58140 

atctttgcat aaattctttt taatcaaagt tcctcaaaag cctctctctg ttcccatctc 58200 

agggagtagg tctggccact gatgagtgtc caggccacag tacaggtgtg cgtggttctg 58260 

tccctgtggg aagggcacat ctgtgttgta acaggattcc tgtcttaaca agccttgctc 58320 

aggctctaag tggtcctgag ctagctaact gcccttggct ttcccttgat taccagataa 58380 

ctattcactc ttctcatttt gcagagcact taccaggtag ctatgtcctg gaagtacgaa 58440 

tgagtccttc tattgttttt cttttactta aatcccattt gaaatgcgcc agggacactt 58500 

caatccaagg tacacttttg ctaaagaatc actcattttt atatgcaaaa tgtcacctat 58560 

taactgcagc tgatatggta catacatatt ctctcttcct attatccact aataggtgac 58620 

taatgcgaaa tattgagtaa tttttaaaaa tcaatactca attttttaga aataattaga 58680 

gagacattca actctgacac cagcacccta ctcagttcct gagccttcct ctgccggagg 58740 

agaatctata aataactcac gaagctgaca ttactcactg tgttgcagtc atttttttct 58800 

gagaaaattt tagcaactgt tctaatagag cctgccagtt atcagtagtt gagaatgcaa 58860 

gtcaactttt aattatgcag acgctgatta ttcagacgac aaattgttgg tgcctgcacg 58920 

gctccttcct gctgcctacc tttaaccgtt ctcagtgctc attagcacat gttccagaag 58980 

gtaggctttg gaggggcgga caggcactca aaccagctaa gcacttagag aagctctgat 59040 

gaaagatgtt aatgcagttt gtagaattat tgactaaaat tgagtcattt ggattccctg 59100 

tgaattgtat ttacatgccc tgtccctgtc ccccatagca acagataata ggattgtctg 59160 

cagagagaca acatagttct tatatttaat tttttccttt gtcgaacatt ttcacatgat 59220 

ggttcgtggt gtttcctttg ttcattacat ttgtatccag actagttact tctgataagc 59280 

ggttagttag gattcctggc acgcggacag tgacaccaca gttgtctgat cgtttcccac 59340 

ttttttacaa aaccgtttgc ctttaagagt cagtgttttg cacatttcac ccagattatt 59400 

ggaaatatta tttccctcct gcttaaaccg aagctgtgat cataatttaa gcctttctag 59460 

gtagccgatc ttacatgtat catacctatt tctggcatat gtttgtctat tacaaagacc 59520 

tcgtaggtat gcagttagaa gcctctagtt aaatgaaatg ttgcgtgtgt gatgaacctg 59580 

gagtggggat ggccttttgt gtgccccaag gctgttgtgt ttcacacagt tgttttctgc 59640 

ctcctctggt ctatcactat cctgccactg ccagaaaacc ctgctgtgtg ttccccgcgt 59700 

ggaggatctc tgcttctgaa cttctttggc ctgagaaact ccataaccaa atcagttagc 59760 

attttgttta aagagcaggt aggctgttag agcttgggtc ttacatgtct cccaggtcca 59820 

cttgccagcg ccttgaccac tgttaacttt tgttaaccaa ctcatctttt gctgcctgtt 59880 
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ttttgggggg tttttttggt tttgtttaag ccaagatcag ttatatggcc caggctgagc 5994 0 

ctctcttccc agcc'tctcaa atgttagaat tacaagcatg catccctcag catacctttc 60000 

ctttgctttt tttaaaatag agttttgcca tagcaacaga aatctaacct aactaagcat 60060 

agccgtgcac atggtatgag gaactcacat atgtgtgaat ggaagttcat agagaccggc 60120 

atcactgcct agaggcccct ttcttccttc cttgcagttg tcgtgctagc tgactgtact 60180 

acaaaagagg ttgtctgagg cataagacta ccttcaataa aacatgcaca gacagtttgc 60240 

ttctctgaga tttcagagca gtgactacct tcaataaaac atggacagac ggtttgctta 60300 

cctgagactg cagagcagtt tccaaaaatt ttagacaaag ggtaggatga agaaggctgc 60360 

ggggttttgc acacacttaa ggtgcgtaag taaataaact gagctacact gacaggatgc 60420 

tcgttctagt agccaaccaa agagcagttg aaccaaagca cctagacttc aaacatcgtg 60480 

gggagataat cttaggagtg ctatgcttct gcgtcctaca agtattatga aactgtctag 60540 

aaagcacccc actggtaatc cctttttgat tatttttttt ataaattcta gtcttggggt 60600 

tttgagtggc acacagacat aatggttagg cttcggtgtg tgctcattca ctttgcttcc 60660 

tggggaccag agtttgcgat gagtcatgtt ccatctgatt tctgtcggat ccggctgcag 60720 

agccatgact cagatgggct tcaggcccag ctgctcagtt catcttctgg ggaatagatg 60780 

acaaggacgg gacaaatgtc ctgacgcaca tttccttctg ttcttgcact tccagggtct 60840 

aacgagagca tcattaccaa cagcaggcag atacgccttg ccacaggcat cttccctgtt 60900 

gtcagcctcc tgaaccactc ctgcaggccc aacaccagtg tgtccttcac tggcactgtc 60960 

gccaccgtcc gggcagcaca gaggatcgca aaaggacagg agattctgca ctgctatggt 61020 

gagccagcct ttctttccac taccctgctg tgcctcacac ctcacatgaa aaggataagg 61080 

ggacaggaat cagcagatat gggcccagtg cctctactca tcctctgagt ctttcctgga 61140 

aagggcaatg catccttggg ccaataaaaa aggtcttctg gctgtaataa aaaagcccgt 61200 

tgagggcagt gagccatatc cctccatgcc ttgtagacag cctatcctga aaatgagcga 61260 

ggagcacttt cttggcttct ttcttcctgc cccagcagct tggaaacgta tccactttca 61320 

cccgtgtttt gttgtttttt ctgagatgat agggcagagt acccaacctc atataggcta 61380 

ggctagtgtc tatcactgag ccaggacccc aacccagcac caccatgcca gtcacgtgat 61440 

gactaggcca gcccctcggt agagtaggca ttgactctct- tggtgtgact aggaactgtg 61500 

ggtaatctct ctccagggcc tcacgagagc cggatgggcg ttgctgagag gcagcagagg 61560 

ctgagttctc agtacttctt tgactgccgc tgtggggcct' gtcacgctga gacactgaga 61620 

gcagctgcag ctcccagatg ggaagccttc tgttgtaaga cttgcagagc gctcatgcag 61680 

gtaaatctct gctgttccca ggggcagggc tccagctaaa ggttgtcagt cgccaggaga 61740 

accattcctg cttcccttct tgtaactcct ccctacatgt cgcccggtcc tgcagaaaac 61800 

acaggttgta tttcctaata ttttccctat aagtgacaca aaatcttaaa ttacacaaag 61860 

ggaccaaaaa aaaaaaaaaa aaaaaagccc tagaaattta cttgctcaaa taagtcatca 61920 

aaagttgtgc atcaggccta gcacttgggt actggtaacc ctagcactca ggaggctgag 61980 

gaagaaggat ctcaagtcgg aggccagtct caagtgacac cccatctaag agatcaccat 62040 

tccaaggagc tatttcagag atggtttaat ctggggaccc agattgtgga ttttctgtct 62100 

gttcaattcc atctctctgt gctggcctca tcagacacac tctgtagtaa ctgtgggaaa 62160 

atccgaccca catagttttc cctcagcctt tgacccagag ggaagagcca cagtggagag 62220 

catgagagca gacccttggg tgctactgcc aggtaatggt gtagacactg gagtcttcaa 62280 

cattcatgcc ccaatgcaaa atggtctcca caccagagca tggcattctc attagaaata 62340 

agtaaatgga attggctgtg ttgaaaattg taaagccaag ggtcaagaat gaagccttcc 62400 

ccagcatgtt ttgttttgtt ttgtgtttta ggcagcgtct ctctgtgtag ccttggctcc 62460 

tgccctctgc tacctctccc aggtgtgcca ccatgctggg cctaagcgcc ctgtgcatta 62520 

gtgctccctc gatcctgctc actcttgaga cagtcttcct tctactctgt atccccagat 62580 

aacctagagt tcacttcaga gcccaggctg gcctcaaact tgagatcctc gtgtcccagc 62640 

ttctcaaatg cagtgatatt tacaggccta cacctggctt tccctgatag attcctagta 62700 

agatgattat cctttgagcc atatctctct tctgcttctt cctctcttcc tgcagggttg 62760 

atctagaatt tattctaaag ctgactggcc tcagaattgc catccttctg cctttagnnn 62820 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 62880 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaca tagtgtcaac tttcaaattc 62940 

tgccttaaga gttctttgtt tatgggaatt tatgggaatg ttccacagaa cccatccagc 63000 

ggagttctgg ctgttgtttt ttaatcttta ttcatcttgc gtgtgtgtgt gtgtgtgtgt 63060 

gtgtgtatgc gcgcgtgctc aacttgcaaa attgcaaaat tcagtctcct ctttccaccc 63120 

tgtaggtcct ggggatcaga ctctgttagg cttggtggta ggtgctttac tgagccatct 63180 

tacaggcccc ccatggacaa ctttttcttt gaaaacctgt ttctggcttg ggtgtgatag 63240 

ctcacacctg tgaccctacc accactcatg aggaagaggt aggaggacta acagaattgg 63300 

aagccagcct ggactacaca gtgagtaaag gctatctata tactcaccac atggcaagac 63360 

cccgttttaa aacactgggc aaggtgaaac aaaagtcaat taatttcaca taaagtcaat 63420 

agcttcatta acggcctagt tatctttaaa actgtatgca ggttagtact tggtttcaat 63480 

tttattactt tttctctgga acatttaaaa gtactttagg ggctggagag tcagttaaga 63540 

acagtggctg ctgccaaagg actggagttc actcccaagc acccaggtgg caatcacaac 63600 

tgtctgtcat ctaattctag gggatctgac accctcacag actcacaggc agtggaacac 63660 

caatgtacat aaaataataa ttaaaaaaat gaaataaaat accaggcaag gtggcacacg 63720 
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cctttaaccc cagcactcag gaggcagagg caggcagatt tctgaattcg aaggcagcct 63780 

ggtctacaga gtgagttcca ggacagccag ggctatacag agaaaccctg tctcaaaaaa 63840 

aaaaaaaaaa aaggacttta aattgggctg gagagatgga ttaaaagcat tggctgctct 63900 

tcccagaggt cctgggttca attcccagca ctcaaatggt ggctcacaac tgtctatatc 63960 

aacgcaatct aacaccctct tcaggcatgc aggttacatg tagacaaaac atccatatgc 64020 

ataaaataca taagtaaatg agtcttttaa tgtatactag aagctgggtg gtggtgcatg 64080 

cctttaatcc cagcacttgg gaggcagagg caggtggatc tctgagttgg aggccagcct 64140 

ggtctgagta aatagagcct tgtacttcta cttatcacta cagttacatt ttataacttt 64200 

gggccctagt gcttccattt tccactgttt gcttaaccac tggggcctga agcttttgtg 64260 

ctgacacttt tgttcgctaa tcatcaggca accaatggtc tctacactcc atcaccatca 64320 

acacaaacaa aacaaaacac aacactacgg atcctggcat ggtggaacat ctttagcccc 64380 

agtacgtggg cttgagttca aggccggcct ggtctacata gcaagttcta ggatagtagg 64440 

gatagtcttt aaaacaaaac actattttat ttatgaacaa aacatgtaaa gaaagaaaaa 64500 

aaactgcaaa tttatctatg aatgaagtct aagtaatact tcaatattgg aaatagcttt 64560 

ctaaaatatt tttatttaaa gaaaactcag caaattattc aaacaacctt ataaacgttc 64620 

gttataaaag taaagaatta tttgcaattg ccttaagggt ccaaggtggc agcctcttaa 64680 

aattcagaac aatccaagct tcacattcca gttcaacatt tctacagccc taacgtattc 64740 

aaatacctcc attctgacaa ctgtttcccc tcttcttttc ttctaagctg cttagatgtc 64 800 

tgtcccaggc ttttcatgat tttagtcatt cacacaacta gcaaacatta tctagggact 64860 

aaaacttgcc agatactggg atatcaccct aaagggggac tgaaagtagc tgcaggctac 64920 

agtctctaca atctcctgaa tgaaatacaa agtagctaat atttaccaaa taaacatgta 64980 

cacctgtgat gattgctagc tgtactagca gaagctaaac actaaatcta gaaactcagt 65040 

cctccaacta gccccttgct cggcttcagc ctcattttta caaacaaggg aaagagtttg 65100 

gaatgttgcc caaagccata cataagtgaa caaaaaggag ttggagtctc caaatgcatg 65160 

gatttgggct agttactttg ccaaccaact cagtaacaac tgagctgaac aggaacactg 65220 

tggtagcaaa agaaactgga actatcaatg gcctctagag caaaaatata tttaaaaaga 65280 

aaaaaacaaa caaggcctgg caaggagact gtgagaagag tgtgctgact gaaattgact 65340 

agttcagcca acaaaagact attccagggc tggtgagatg gctcagtggg taagagcacc 654 00 

cgactgctct tccgaaggtc aggagttcaa atcccagcaa ccacatggtg gctcacaacc 65460 

atccgtaaca agatctgact ccctcttctg gagtgtatga agacagctac agtgtactta 65520 

catataatca ataaataaat ctttaaaaaa aaaaaagact attccagtgg ggatggaaaa 65580 

gttaagtgtg gagttaaaat atacttcaac tggtgatgga ctaggtgtcc agagtcgggc 65640 

aaaaggatgc tctgtggtag aggtgcctgc tgtgtaagcc cagctacctg agctcaatcc 65700 

acagaatcca cagcggagtg ggaagagaaa caacgtccca gagttgtcct ctggcatccg 65760 

acgcacattc gccatcccca agatgtcata catatgtgta catactacac actggcgcac 65820 

gcgcacacac actctttttt aaaattcaga cttagaggga cataaaggat ttgctctgat 65880 

atatgttcaa ttgaaaatga ctttgaagat agagggcaga tcgaaggaag ctcagcagga 65940 

aagaattaat aacatgcagg tgaagggcta taaactagtc tgcagagggc cttggctcga 66000 

caaaaaaatc tatggggttt gccggtaaaa taaggaaaaa gttgtcaaca tgaaacacag 66060 

aacactagca agagaggagt gttagcagaa agaagccaac aagctcaaac aattaggtcg 66120 

gctgaaaaat tttaaaatgt cttctgattt ggctactggg aagccactgg tgacttcggt 66180 

cagcgttttc tctctcgtga ccagagagat gtctagtagc aataatgagt taggaggatg 66240 

taaaagaagt aaaacagccg aaaacaagtc caaaaagttt ggggtgatgg agaaagggag 66300 

gaaacagagg ccgccgaaga tagacagcgg catgtttatt tgtcttgttt tcttagatgt 66360 

aaacaaacta aaaaaactcg tgagttcttc tgccagtacc gggttgcctc cagcatcctc 66420 

tgatggtctt agagaccccg ggatgctccc ccgcggccgt ataatttcct ccctgacgct 66480 

ctcccgatcg acagcggctc cctccccggg tcctctttgc accgctccaa ggccgcgctg 66540 

ctagggccat cgagcccgct cagggtcgtc tccttacctc gatggccccc tcgctcaggt 66600 

gtcccaccat ggctgcaccg ctaactcccg cgctcgcgct . cttgcaccgc ctgagcttct 66660 

ctgccggggt cccgcgggct gctcaacgat tggctagagc aactgtgcgt gccgatccgc 66720 

ccccagcgtg agcgcggtgc gaggggcggg cctagacgcc gatagccacc gcattggcta 66780 

ccgcgcggca ggcagagcac gtgactcttc cgaggccggg ttcgaggcct agtggcggga 66840 

tggcgggacg tgagggcggg gcgctgggtc gcagtgcgcc tgtgtcagcg cggtgctact 66900 

gagttgttcc cccgccagct gtcggaactt tgcccgccca gtcctttggc ggacagacag 66960 

aatggcaacc cagggaacag tcggagctct cccctggtaa ctgctgctaa atatagtcaa 67020 

agcagtgacc tgggtacttc ttcacgcagt gcgtgcccgg cgccggtgcc aggcccagag 67080 

cttggcactg tgggataaac aaggtaaatc agactcagtc tccgccctct tgagttccac 67140 

ctgagagttg tggccgcaag gaacccagcc tcaaggatgg tagacgcgat atgggccaca 67200 

catgtggagc tccagagtgg gggtcaaaaa tcaatcaggc tttcgagagg cgatgcggtt 67260 

tgaactgagt taaagtgtgt gtagaaattt gtcaggtgga ttccagtgag gatagtgatg 67320 

ttcctaaaag cccaaatggc ctatgcaaaa gtattggaga gcctggcgtg ctggctggct 67380 

ctgatctgtt tgtaatccca gcctttggga tgtagaagca gcaaaagttc aaggtcaccc 67440 

ttaacaccgt tgagttcgag gtcaacctga actaaatgag accctgaaaa atcaaaattt 67500 

gggacccagg cgtggtggca ttcgaggtaa aagcaggcag atctctgagt tcgaagccag 67560 
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ccaggctaac ataagatccg gtctcaaaaa aaaaaagtaa taaaaataaa aagggagaga 67620 

ggctatatga actgaaagaa agacctggag atcaaaacag aaaactgagc cgtctaagaa 67680 

atgaaaatat ttaacttcat agttgctgga , gtaagaagtc tggaaaactt tgggcaacta *67740 

aggtaaacag gtctagaaag actggaatag tagccatcta ctggtatttt gatctctgtt 67800 

tgtacaacca caacctacta tagtttctca aacagttcca aagaatatgt ctgggtgaat 67860 

tggtaccaca ccacagatta actctccttc agcatatcaa cagctataga aaaccccaga 67920 

agaaatgatt ttggttgcgt gtcacttggt aggatgaaat ctcgattttc tagaactatg 67980 

cattaataga aagctgaatc ttcatgttct gactttacag agctgcggca gcatggatct 68040 

accggtggat gaatggaagt cctacctact taagaagtgg gcttcactcc cgaagtctgt 68100 

gcaggacaca atttctacag cagagacttt gagcgacatc ttccttccfct cttcttccct 68160 

tcttcagtaa gtgaatggaa acttcaggga aattttggtc tggaaaatgt tctgccttgt 68220 

catttggtct gaatatctct tttttatagg agagagtagc tttatattct ttatagtatg 68280 

gggcatttag cagttactgt tggttttcac gtttctccct agtctgtgat tactagaatg 68340 

ggtaggcact aactgctttc ctcttttggc atgtgttata cttaaggaat gtagtatctt 68400 

gctgtcgtcc cagtgctgtc actcatagga tctggtgcag gttgtgtagc tgcccctaga 68460 

agctcattca gtcctaatgg ggagaaagaa ccctggcact tggttagttg agacccanaa 68520 

cttctcaagt tctnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 68580 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnntttagtt 68640 

tccaagtgcc actttactgc aatgtgtcac cacatccaga gttctgtgtt tgtttatttg 68700 

tctgtttttg agacagggtt tctctgtgta gccctggctg tcctggaact cactctgtag 68760 

accaggctac cctcaaattc actgagatct gcctgcctct gcctccagag tgctgctgta 68820 

cactaccacc acccatccag ctttctatat tggttttcct atggcgtttt aaaatagcca 68880 

ttatacgtgt gtttatcatc taaagtcctg gtcccaaaag gagatgagag gggctgctaa 68940 

ggtgaaaagg attacaaacg cttatcaatt ctgttcaaaa attaaacctc agagtgggat 69000 

tagctgcttc tttcattaga attgctatca gaattcactc aggccttgtt tgcgtgtgtt 69060 

attgaagaag tctcttcctc atcaggtcag tgactcctta gctcaagtac atgcaatatg 69120 

cagtattgat aactgttctc gcttaggaat aaaaatagaa ctgacttcca cagggaaatg 69180 

atgtgctgag ctgtagcaac gaatcttgca caaactctgt cagcagggac cagctagtct 69240 

ctcgcctgca ggaccttcag caacaggtct gcatggccca gaagcttctc cgaaccggta 69300 

aaccaggtga gattggctcc ctcgccctag gcctcagccc ttcccttgtt tattttggta 69360 

tcaccttgcc ttactgagca gtcctcaata aatgactgag gacttgaatt taattatccc 69420 

agcaccagcc acaagatggc tatgtaggcc agtgagacca gactgtgacc agctgttact 69480 

ctggtgccct tgaaagtctt cctgatggtt taagctgtgt ctgctgcgcc agatagttct 69540 

agcagctcga gcaccagaaa ggctgtctga cttccatggg ctttgtgtgg ctccagaggt 69600 

ccaatgccat catctgattc ccagcttaag gacctaagct ccgagaaggt tgctctgccc 69660 

tcagcagcag cagcaagtcc tgagtgctgc ctgggctcgt ggtgtgactc aggagtagag 69720 

ctcggtagct agcctgagct gagagctgag agaaagaaag gactcctctc ttttcagaaa 69780 

gggatttgca gaactcgatg ttagaccctg acatggtagg aatctgtttt gactattcta 69840 

gcctagattc tgaagttgac ctttagccta gagtcaagaa aactaatgat tacaggagga 69900 

atgtagagtt ggttgttaaa tgttggttgg aaaatggatg ttagaagccc agggtaaatg 69960 

tgaggaagcc tcatctaaca cctcttttac tgaaagagaa aacataagca accaacagct 70020 

tccctggaat gcccggctgt tgactccgtg agataaagag gcattttcac tttgacctaa 70080 

ccgatagaga ccttgcaacg tggtctctcg tgtccaggac tagatctgta tctgttgtga 70140 

ggcatttttc ctttgaatcc atagagcaag ccattcagca gttgttgcgg tgccgggagg 70200 

ctgctgagag cttcttgtca gcagagcaca ccgtactggg ggaaattgaa gatggcctgg 70260 

cccaggccca tgctacctta ggtatgctac cttaggtata gccggagttc tccttccctg 70320 

ccgtgtgttc agtgcggccc ttgccttgtc tgtttggttc tctcttgcca tctgaattga 70380 

cgctcttctc cctcccattc tgcattcctt gcccccagag ccttaggcta atggtgtttc 70440 

ttttccggaa tgagacattt ctcttctcac agggaactgg ctaaagtctg ctgcccatgt 70500 

acagaagagt ctccaggtgg ttgaaactcg ccatgggcca tccagtgttg aaattggcca 70560 

tgagctcttc aaactggccc aagtcctatt caatgggtag gcctttcttt ttcctagtgt 70620 

ttggccaggg cacacagtgc tctgtgtttt cctaggtgct tctgtgtatg gctttttgct 70680 

acagtgcttt aaagcatgtt gaaactcttt tatttcctct ttaggacaga tatttgccct 70740 

ctgcttcact gatagacttt aagctttgaa ttccttcctg aggatgtgga gaaagccatt 70800 

aggtctgcat ggagcttccc agggaggatt tggaggcagc ctcacccgcc tctagcattc 70860 

ctgtctgctt aatcacacct cccttggctg cctcagtccc tgctctctca actccagggc 70920 

tcggcccttt ccctggtttg cctcttattc cttttaaagc agtggttttc aactagaagg 70980 

gattgcaaat ggcatttggc agtgtttaga gacagttttg attgttatgg ctgccagcat 71040 

ctagtaaagg ctaaacctac agtgcacagg accgcctcca cagtggagag acccaagtta 71100 

gctatgtgaa ggctgagaat ccctgctttg gagattaaaa aaggaagctg agggaaccac 71160 

tcagttggaa gcacccttgg tggcatgcac aaggccctgg ttctgtccct agctctgcac 71220 

aaaaaataga atacaaggaa gagtaaccct aatgagctgg tccctcaccc agtgtgccac 71280 

tgaggtcact tgaagggaag tctagcccca atttagtatt ttttgtggct gccatacctc 71340 

cagccttgat caaatctcat ggtatacatt ggtaagaaaa agggtttgaa acatagacct 71400 
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gatactcgga catggaaaca gtatgtttgg tcagagagag cgaaggacct gatagacgag 71460 

ggcaatatca gagagagggc atcagtcggg ttagacacga gcattccaca gtgagcagct 71520 

ctggataagc ttttataaat gctggttaag gttttgaatt tgcccaattt tgtcaggatc 71580 

ccagagtcta tcacaaacat acacagtttt ctcaaatctg ctttgcagta tgcccgtgaa 71640 

tgtctcttat ctatactttc agatggtaag accctgaggg cagaggaact cagacccttt 71700 

gtgccccctg taagaccctg ggggatgcag tggacccgac tttgtgttct ctgcacagaa 71760 

aggagtccac tttcgttgag actaaggaag ggaactgaca agcttccctt tctggcttca 71820 

ggttggcagt gcctgaagct ctgagtgcca tctggaaggc agaaaggatc ctgttggtgc 71880 

actgtggccc tgagagt'gag gaggtccggg agctccggga aatgaggtcc tgcttactgg 71940 

actcgtcatt cgtccctgtg gggcccttgg tgtagagcaa tcatcctcac cctcaagaag 72000 

gagctctggt gatgactgag atgttctgtt ggcttggagc tctcatcaga gaggacggga 72060 

ccttcccacc tgacctgagc ctagtgtctg gcacagagag cacttgaaaa cagattgaga 72120 

cactcacctg ccatgctggc tgctgcttgc aagagctaac tgccctctga tggaaacccc 72180 

atgcccagaa aagactaaat ccagtatcta aaggctgctt taaagggttg tcactgcagc 72240 

cgggcttggt ggcacacgcc tttaatccca gcactcggga ggcaggcgga tttctgagtt 72300 

caaggccagc ctggtctaca aagtgagttc taggacagcc agggctacag agaaaccctg 72360 

tcttgaaaaa ccaaaaaaat aaaaaataaa aataagtaaa aaataaataa ataaataaat 72420 

aaagggttgt cactgatctg caggcagctc atgctagcct aggcttttgg ctcgatttca 72480 

tctcactaaa cgatgaatct gtttccctgg aacattccta tggtttctag tagtaatgaa 72540 

gtgctgtgtt ccactccagt gagaacttca attcttagtc ttgtattata attgaaaaat 72600 

aatatatagc aagaaatcag tatgactgct tacctcaaga gacatacaat tccacttaca 72660 

atatcctgct tccttaaatt tttcattaag actggtgata tataatttgt gaatggagaa 72720 

ataaatacgt cttactgttg gcagtttctt cctgggatgg caactctgta ttggtttcct 72780 

accagtgtcc taattcttac tcagtggctt tcattgagtg ttcttggcac tcactgtcca 72840 

agcactgatg caaggcaacc ctgtagcatg acttcatagc acaggcctcc ttgttagcac 72900 

acctgaaagc agaccactct ggctgtttca cttgcagaca gaatcttact ctgtaagcca 72960 

gtctagcctc aaacaacatc ctcctgcctc agccttccaa gttctaggtt tataggaaaa 73020 

ggccaccttg cccagcttga gactgcttct tactgccatg tctcttcagg ctcacacatg 73080 

aagtccaggg cactccagga ggagccgtga gtctgtctgc agggcactcc agggggagcc 73140 

atgagtctgt ctgcagggca ctccaggagg agccgtgagt ctgtctgcag ggcactccag 73200 

gggaagccgt gagtctgtct gcagggcact ccagggggag ccatgagtct gtctgcaggg 73260 

cactccaggg ggagccatga gtctgtctgc aaggcattcc aagagcagcc atgggcgtca 73320 

ctcattggta gactgtgagg ctacatctcc agatgccccg agtgctgtgg ttgtgagcac 73380 

tgctgctcat ggtttccaac tgagacagag ggaaggactt tgcccctttc cctaaggatg 73440 

ggtagtaata gtccagacca caagggacag atagctatgg ggttttctga ctcatcctta 73500 

gtacattatt gctgatgacc agtttgtttg gatgagttag tgggaaagaa gacccaagtc 73560 

catacactct gctttttaga acttgctcat cctagccatg cccaaggagc agccgttgac 73620 

tgtcatggca ttacagtgag gaaataaaca gtcctgaagg tgcctggcag cagcttttca 73680 

agaagctggt gttaaaagac agtattcaaa catctgcgga ctgggaactg ggcagcattt 73740 

gagtctcctg ctgtctgtta atttaccctg acaaggaggt gacttgaaag gtttgttttg 73800 

tttggggtag agctttttca ggaaaaaagt ttagtcctac agacaactct atagttattc 73860 

tagtccaaac tcatgccttg tgttttattc ctaaaagccc tgtcacactt tgtaaaatag 73920 

gtgctcttcc tcaaaggata tatttaacgt tttatatatc aggccttatt ctgtgcatgg 73980 

aagctttttt tagatgcttt gtaagatggc tcagtggtta agagcatgta ctgctcttct 74040 

ggaagtcctg ggtttgattc tcagcagcta acaccagctg ttattccagt tcctgggatc 74100 

tgatgccctc ttctggccta tgtgagcact gcatgtgcgt agtgcacaga caaatgcagg 74160 

caaagcactc atacataaaa ctaaattcaa aaaactcttt cattgtctca tgtgacctag 74220 

cttgagaata cctgtgctta tattataatc tagtatgagc cagccacggt agcaacacac 74280 

ctattatctc agcactcaga agattgagac tagatggtca agagctagag tctgggttac 74340 

aaaacacctg tctcaaaagt aaaagggctg aaaaagtgtc tcagcagcta agagcacaca 74400 

ctgcttctcc agagggcctc atttcagttc ctaataccca caccgagtga ctcaaccacc 744 60 

tgtaactcca ggtccatgag atccaacacc tctggtcgtc tgcataagct cctacactca 74520 

attatacaga gagagagaga gagagagaga gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 74580 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 74640 

nnnnnnnnnn ntctagagtg tttcaggttt tttttgtttt tttttttttt gagacaaggt 74700 

ctctctatta tgctgcctgg aactttctat gtagaccagg ctggactcaa acttatagtg 74760 

atccactact tctgcctctc agtactggta ttgaaggcat gtgtcaccac accccactac 74820 

ttcaagatct tagatttcca aagaagccgt agcctagaaa aggttaataa gtactgattt 74880 

aaaacagaaa gaaatcaggt acacttagag ctgtagaatg tcagcatgtg acatttgtga 74940 

caagttgtca aaactttgct cttaattcta aagagagaag ctgtcaaaag acttgaactg 75000 

gggctgtagc caacttggtc gagcccttgc atgaagctgt gtgtttactc cccagcactg 75060 

tggggtttga attgatttga acccagtaga ttcgtatatt tgaatgttta cctcatgggg 75120 

aatgacatat tacaaggtgt ggccttgttg gaggaattgt caatttgggg gtgagctttg 75180 

aggtctctct gctcaagctc tgcccagggt agaaagggag cctcctcctg gctgtctaca 75240 
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gaggacatag tctcctggct gccttcagat caagatgtag aactcttggc tcctccagca 75300 

ccaagtctgc ctgcacaatg ccatgcttcc taccatgatg ataatgaact gaacctctga 75360 

aactgtaagc cagccccaat taaatgtttg tctttataag agttgccttg gtcatggtgt 75420 

ctcttcataa caataaaagc ctaactaaaa cacattcctg ctgggcagtg gtggtgcacg 75480 

cctttaatcc cagcacttgg gaggcagagg caggaggatt tctgagttcg aggccagcct 75540 

ggtctacaga gtgagttcca gaacagccag ggctacacag agaaaccctg tctcaaaaaa 75600 

aaaacaaaaa caaacaagca aacaaatgcc agcatttggg aggtagagtt aagaagattg 75660 

ggagtacaaa gtcgtctcag ctagtatgtt tgaggccagc atggaccaca tgagacgttc 75720 

tcaaaacgaa agaaacgaat gaatagataa acatttgagt gtccagtttt ttcctttctt 75780 

tcttgctttg tttttggcgg tgctgaggat taaacccagg accttgttca tactaggcaa 75840 

gcattctcca ctgaggaaca ccctggcgag tgcctagtct gtctgtctgc ctgcctgcct 75900 

gcctgcctgc ctgcttgtta tgtgtatgag tggtaacctg catgtctgtc tgtataccac 75960 

agacatgcct ggtatctgca gaggccagaa gaggatgttg gatcgcctgg aactgggatt 76020 

acaaatggtt gtaagctgcc atgtaggtat tcagaattga acctggtgct ctgaaagagc 76080 

agccagtgct cttgttgttg gttttattgg gggcaggagg tagttatttg gttggttggt 76140 

tggttggttg gttggttggt tttcttgaga cagggtttct ctatgtagcc ttggctgtcc 76200 

tggaacttgc tctgtaggct caaactcaga gatctgcctg cccctgcctc ccgagtgctg 76260 

ggataaagtc atgtgccacc aactccagac aagcagccag tactcttaac cactgagcca 76320 

tcattccagc ccttctttgt gttttgagat ggtcacaaag tacaactcag actgagctct 76380 

tgatcaccct ccctcagcct cctgactgct gggggttaca ggtgtgtcac tgtcctcaat 76440 

tctgagtgtc agatcttgaa aacccattct cgtgaccttg atccttaaaa caaaccctgg 7 6500 

gagaatgagt tctgataact atttctcact cctcttcaag aaaaggaaag ccagagaaag 76560 

gggggggggc aagccccaga aacattgata acttgcccaa agttacacag caaaattcag 76620 

acagcctgca catctcagtg gccatctgtg ccatatccac cctgcccttc tctgacctcc 76680 

ccacctccat ccctacagac cttgcagttg agatcagagt ccaagccgta tcgtaagatg 76740 

gccttaggat ctgacatcat ggggactctc acggtcctgt cctcgtccaa atgaaaatcc 76800 

tggagggtcg tctttctcga gtcaaacttg gttacccact gccctggaag gaaacagatg 76860 

gagcatcctg agccactgtc cccagaaagg ccacaggtcc acgctgtgcg tccactggcc 76920 

aaagcaacct gagctgtcag cagcaagaac acaggagccg ctgggtccca gcatgtgtgg 76980 

cacagaccat aggctccatg caccacgggt tctggctatc ctcctgtagt aaactcagaa 77040 

ataagtgggt gttctctctc tgacttggat caccacgctc cttctgttta aagtggcctt 77100 

taatatgctg gtgtgtggta cgtgcctgct ctcctgtccc ctggggactt ggagtaggaa 77160 

gccccagggc tttcctctaa gttaggatcc actcttgcta ctactccata agatggtcac 7722 0 

aaagcaacgt aaaatggaaa ttaatcaaac cattcctgcc acaagaataa aacagatctc 7728 0 

aggggaggcc tgtggaaggg tctcctgagg ccttaccact gtctagaagg aagttgacag 77340 

cagttcttga gcaggggtgc gactccagga gttgggggct gctctgagag caggacagca 77400 

tgtattgtag agtgtctggg agggagctgt gttatcctta ccgtgaagat gaggacacgg 77460 

gctcatgggg gcagagccag gattaaacct ggtctgattc aaaaagccag agatctgtgc 77520 

ccagccccac gcagccattt cactggtcaa ctaattcaga aacacttggt ctgatatgct 77580 

catatgctac aagcactgtg gccttcagat ctccctctgg cctggtacct gcattcaggt 77640 

tcaccaccat caccacacac acacacacac acacacacac acactcggcc agagacaagt 77700 

ggggaagccc tcacccttga agtaagccac gccaaggaga aggatgctga gggcactggg 77760 

catttccctc gtggaccggg caatcttccc tttcatctgg gcctgcaccc agttgttaat 77820 

ctcctgaagg tctactcgag ggttgcccgt gaggatccgg ggcctggtcc cataggactt 77880 

ctccagaggg gcaacaaagc tggatttgac tcgaagttct tgagaggaaa cagatcaaag 77940 

atgagagctg aatcagcacc ctcactttga aagcatgcca gaccccagct tcctgctcag 78000 

catcttcctt tgacttgctg gggcatctgc cggcttgccc agaccctggc tagggaacag 78060 

tggattccac cgtttgcatt ccccgtccca ggccctcctg ctgtctccca gagcccactt 78120 

cctctttctg ttcctctgtg gtctcactgg ctctttcctg cccaccagtg ccaggcctcg 78180 

cctgagcaca cacagcctat tgtttagaca tcatggaagc atacagacaa cccaggccaa 78240 

tgaagcaact tcacgccagg cataatgggg cgtgcctgcc cttcagaagc agaggcagct 78300 

ttatgagttg ggggaccagc tgagactcta tagactttga gaggtggggt gggggtgggg 78360 

ctactgactc ctctcaaaca caattctgga agcactcttg aggttcttct caggggcagt 78420 

aacagaggca aggagctcct tgtaggtgct gtggatgtca gggttggtga tcaggtcgta 78480 

gtagagagcc cggtgaatga cagactctgt tcgatgttca gctcctgcca gagagaaaag 78540 

gatgccaagc ttcataactg cccgtgaggc ccgcatcagg atagggacgt tagacatcaa 78600 

tccttttgtc ctctgagagc ccgaggaggc cgatattgca gatgttttag ctggacaaga 78660 

tcttcagggc gtggaaagaa ataatgaccg ccttgctagg aagagctcta agacagggca 78720 

aggttatcag agctacagag agaagagtgg gatgtggtcc tgaagttctc ccatcgtaac 78780 

ctaccctgtt ctgaggagga gccagctctg ctcacggcag ctgtacccct agaacctggt 78840 

taaatgacta aaacacgata ggaggccact taaggaacca aggtcgagtg ccacttacaa 78900 

agtggtaggg attgtgtgtg tggcccccac cgcccctttc ctgttcctct gacggcggca 78960 

gcatggaaac tctgagtggg ggaaattcag gtccacctgc agccttcttc agttgacact 79020 

cacccagaga aagggcagag agggccgtgg ccacgctgag tggagacagc aggac'gttgc 79080 
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ccgttgggct ggcactggat ctcaggcggt acagatcgta gccgaagttg gagacagctg 79140 

ctgccagctt gttcacaggg accttgaaga aggggtcctc ctcctccacg ggctcgcccg 79200 

tgctgtccgg gactggggag ccctgggtta gaatacaagg accagtaggg aggcacagtg 792 60 

agtacatcac ctcctggttg ggttggtcct ctagtccctg gggccatgag tctgaggtca 79320 

gaatgagtgt gtgctctctg actccacaac ctgtgtgctg ggaggtgggg agtgggaagg 79380 

gcaacacaaa agggcttgcc agacctgaac tgtggtctga gaacctgaag cctggcccac 79440 

tttaaaataa aacttgtagg gctggggaga tagcacagta gataaagtac cagcatgcaa 79500 

gttcaaggac ctgggttcag tccccagagc tgggcacggg ggtgcatgct tataatccca 79560 

acactgggga ggcagagatg ggcaggtcct ggggctcatt ggccaatcag cctgaactaa 79620 

tcagcgtatt ccatctcagt gaggggtcct gtttcagagg gcctgaggaa tgactctggg 79680 

ttgactacta gcctactctg tgtctgtttc tgtctgtctg tctgtctgtc tgtctgtctc 79740 

cacccctctc tgtccccttc cctctgcagg gaacttcctc accaccacca acccccaaag 79800 

aaacccaccc tcagaccagt cttccctatt cagcttgctg gctggtccta gtctgcctag 79860 

gttctgctgt gacgccctcc ctgtctttcc tgacaagcca tcccctctga ctagacccga 79920 

gaggaatttg tcgttttctg acctgttttc agtgtcagcc tcttccttat gagactttct 7 9980 

gcttttttgt tttgttccca gggctttgga tcaaagctgg gctcttacat acgttaggca 80040 

aatgcttggc cacccagctg tacctcccgt ccctgttgct tttcggtttg gaggactttt 80100 

tttttagttt tctgtttggt ttttggtttt gattttttgt tgtttgtttg tttgttttga 80160 

gacaggattt cgctatgtga ctctagctgt cctgggactc actatgtaga ccaggctggc 80220 

cttagattca gagatccacc tgcctctgcc tcctaagtgc tgggattttt agattttaat 80280 

ctgtacctac caacctcaaa ggaagtgtcc atggatagag ttcagtacta catcatgtgt 80340 

gtaacatgtg tgagggcctg ggcttcaccc ccaacagaga agggggagtt agtaggtgag 80400 

gaataagtga ctggctagtg gcaagacagt attgtctaag gtcactaagc cttaagccac 80460 

acttaaagcc cacaatccag gtctaatatg cccatctgcc ttgtccttgt gtgacatgac 80520 

cccaccccta cttcctccgt atagtggcag ctcctctgga tcctgaaagg agagggaaga 80580 

tattcttgtc tcgatgttaa agtaaccaag gcctagaaga gtgaaggcca aagcccaccc 80640 

tggatccagg gctgcctccc tgcactgtct cttctgctgt- cccacctacc caccctactg 80700 

acctcagagc tgctggggac gttctggctg ctgccgtgcc cgagcagggc tccagtccag 80760 

aggagtagca ccagggcctg catcccggaa ctacaagaga aacaagagag caaacgactc 80820 

ccctcaccca caccctcccc tgccactgca cattgcacac tgcacaggga caagagtcag 80880 

gcagagtgag cccttcccct ccctccagct ctcagcccca agtggaccct tgacttgagg 80940 

tcttccgtcc ctgacctgcc cctgcacttc tccttgagct gtgcccccat gttggttcct 81000 

atcaggagac ccaccttccc atctaagctc cagcacaggg aagacccagc agcaggctct 81060 

tcaggcccca agacaatgct ctagcacaaa cacacaccaa ggcttttccg tggaggcaca 81120 

cggccagctc ctttggtagg atttggaacc ctgtctcagg atgggagcag agcccaggtc 81180 

atagacttac agaacatctg gtctggtcct gctatccacc aatagttctc tgaccaaagc 81240 

ctatgttaaa gacacacaca cactttcttc taggtaggtt cttgtgtatg tagcccaggc 81300 

tagccttgaa gttgcagcta ccctacttca tttgcctcct gagtactaca atgccaggtg 81360 

tgagccatca tgcctgactt gactcccttc ttctatttca agagcaattc ttagttaaga 81420 

gggtatgaac cagggccaca ctgccnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 81480 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 81540 

nnnnntctta tttttagctt gtttgttttt cttacttgag acctggggag gggaggtgta 81600 

tgtgtcttcc atttgcttct tctacctaat aaagtttctt ggtgatgttt gggggggggg 81660 

gagggggtag gactcaagaa gggattctcc cataagctgt tccgtttggg tagtactatg 81720 

taaggaagtt acaggtgggc agagctcggc tctgcctgac tggcgtgctc tgaggtaaag 81780 

gtgagatggt gcaagatttg ggccctcagg agttggctct gttggccctg taccttctgg 81840 

tctgtgggta aggatgacca gtaggtgaga gatgagggaa ccagaacaga aggtgaaagt 81900 

tagtggggcg gagccccaga ctagtcaggt gggggtaaac tagatgactt tctggaaccc 81960 

caaggggctc ggagactagt ggtgttggag aagacctcta atgtgttgta aggcctctga 82020 

actcagtagc cgaacttgat gccagaaagc cccaaactgc taaacccaag caggagcggg 82080 

acgccatccg ttccatggct tcacccgagg tggccccatg gctgcgccaa tcaatgagca 82140 

gccgagagat aggggcgtgg acaagccagg aaaagttaca gcacgctgga aagataatac 82200 

aggccaggaa gccccaggca cagcagggtg gaaaagctag atcccgattc tgccggaggg 82260 

gggccccttc gaggtcccgg gcaccgggtg ccaggatcag agaaactgac tgaaacctag 82320 

ctgacctgcc cagaccatgg catcctgggg actccttgtg gctggcgctt ccttcacggc 82380 

gtttcgggga ctgcactggg ggctgcagct gctgcccacc ccgaaatctg ttcgggaccg 82440 

ctggatgtgg cggaacattt tcgtttcgct gatacacagc ctactctctg gagtaggggc 82500 

gctggtcggg tgcggaactt ggggactgac aaagcactga ggggcggggg tggaaaagag 82560 

ggcctggaag actgaagttg gaaccttttg gaatggaact ggtttgggtt gtggatgggt 82620 

gggagtaccc agtgggagaa tggatctagg tctgggagaa attgacctta gctctttgtc 82680 

ttctccaggc tgtggcagtt tcctcaaatg gtcaccgacc caattaatga tcacccaccg 82740 

tgggcacggg tcctagtagc agtgtcagtg ggtgagtgta cagaaaaggc tgaatcggga 82800 

aaggccttgt tggaccggga attctaggtt cctcccccat ctttggaatg gagcagatgt 82860 

tgctggaggt ttgctgtgag gaattaagga cctgagaaaa gtgggacttg agatatctag 82920 
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gctgtgcatc agctctgagc gaggagcctc atagtcttct ccggtgcctt caggttattt 82980 

cgctgcagat ggagttgata tgctgtggaa ccagacattg gcccaggcct gggaccttct 83040 

ctgtcaccat ttggcggtaa gactctgaag ggagaggcca ggtagtaagg gagcatgtcc 83100 

aactcaaggg cccaacctct ctcttcagtg ttctgtcctc tgacttttcc acaaagcccc 83160 

ctgaaaacct atcctctcag acttggattg agttggaggg aggttttgac tggctagcca 83220 

ctcctgggca ctgcccaagg agtttggttc tccccacaaa cctccagctg atcataaaaa 83280 

aaaaaaaaaa aaagccagga atgaaagcta gggtatgcta tgcaaatagt gtggcttggg 83340 

gtaagagaac ctctggtcca gggctgctca tgccccctag ataagggtca gcagaaaggt 83400 

caggattgga ggcagtccta aaaaatgctt gggtaatata aagtgaataa ataaaaaata 83460 

aataaatact aatttttaaa aagctgatac ctggaaggat gaggcagaga gtagaaaaaa 83520 

catgcgtggg tgtccctagg ataaggagct gggacttgtt gggcacaggt catgcaaagc 83580 

ctgaaccttg aaccttgcct gcaggtagtg agctgcctca gcaccgctgt tgtgtctggc 83640 

cactatgtgg gcttctctat ggtatccctg cttctggagc tgaactccat ctgtttgcat 83700 

ctacggaagc tactgctgct ctcccataag gccccatcct tggccttcag agtaagcagt 83760 

• tgggccagcc tggccaccct ggtcctcttc cgccttctgc ctctgggatg gatgagtctg 83820 

tggttgtccc ggcagcacta ccagctgtct cttgctctgg ttctgctttg tgtggctggg 83880 

ctggtcaccg tgggcagcat aagcatctcc acagggatcc gaattctgac caaggatatc 83940 

ttgcagtctc agccctaccc gtttatcctc atgcacaagg aaaccaagac acgtgagcct 84000 

gttgccagga acacttccac tctcagtctg aaaggtgtgg aagttttctc ttctgtcagc 84060 

ccccagggag gtggggctgg gaagaggaga tggtagccca ctgcatagtc tactatgtag 84120 

caaggactag actgtatcat cagagagaga gagagagaga gagagagaga gagagagaga 84180 

gagagagaga gaacattgta tgagatctcc attacagtca ggaaatcagg agatctaaat 84240 

aactttaaaa gtcccacagt ctttacatat tcttaaaatt tcaatctctt taaaatatcc 84300 

atctctttta aaattcaaag tctttttaca attaaaagtc tcaactgtgg gctccactaa 84360 

aacagtttct tccttcaaga gggaaaatat cagggcacag tcacaatcaa aagcaaaagt 84420 

caatctccaa ccgtccaatg tctgggatac aactcacgat cttctgggct cctccaaggg 844 80 

cttgggtcac ttctccagcc aggccctttg tagcacacgc- gtcatcctct aggctccaga 84540 

tacctgtact ccactgctgc tgctgctctt ggtggtcatc tcatggtact ggcatctcca 84600 

aaacgctgca tgaccccttc agtcctgggc cttcaagaga' gaagactaga gcctggcaaa 84 660 

gtggcacatg ctgataatgc tagcacttgg gaatgacaag cagaaggatc agaagttcaa 84720 

ggccagcctg ggctacaaga gactctgttt caacaaacaa acaaacaaac aaaccaaaga 84780 

agagaaagaa aaaactggac atgacagccg gaacattatc tgacattcat aaggtcctga 84840 

gttcaatgcc aagttggcag tgcctagttt gataagggtc tagccactct ggtaatacca 84900 

tggctgactg aacaccttac ccagcaactt gctgatagac tctgccttcc agcaaaaggg 84960 

aggagcttcg ctgaggagag aacattgaac cctattgtat atgaataaat tgctgtgcaa 85020 

atgatttcat cagtctcttg tgaatgtgat tgctttgagt catttttctt ggctccagtg 85080 

ttatcctggt ctgcagtgtg gtgtggagtt gtggaagctt tgagttggga gggtttcctg 85140 

ttaaggtttc tctggctctt ttctttcctc ccggtttttg ttttgtttgc ctggtggggt 85200 

tctctggtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tagaagttgg 85260 

cggggggtgg agggggctgg agagatggct cagcggttaa gagcgccaac tgctcttcca 85320 

aaggtcctga gttcaaatcc caacaaccac atggtggctc acaaccatcc gtaacaaaaa 85380 

aatctgatgc cctcttctgg agtgtctgaa aacagctaca gtgtgcttac atataataaa 85440 

taaataaata ttaaaaaaaa aagaagttgg catggatgat gtagtgaaga ctggcattag 85500 

atatctctgg atccccctgc ctctacctct tagacactgt gagtatggaa gtgtaccacc 85560 

gcaccaggcc aggctagaac attctctgat ctacaaatac ctagagtatt attcctctat 85620 

gatcagaaaa cagacccagg gggccacaga aatgtcttag taggtaaaaa cacttgcttt 85680 

caggcctgat aacctgcggt tttttgtttg ttctgggggg cgggagaggc tggctggctg 85740 

gctggcctgg aattcacaga gatccacctg cctctgcctc ctgagtgtca ggtaccagga ' 85800 

tcacaggtgt gtgccaccac acttggccta actgcctgag tttgagcatc agtactcaca 858 60 

tggtactgag gatagaatag actctcacca gctcttctga cttccacatg tgccctgcag 85920 

catgggctct ccttccccaa aggaaaaata aatgtaagaa ttaaaaaaaa aaaaaaaaag 85980 

caaacccagg tcttgtgtga tggctcagca tcaaagctac ctcccgccac agctgaccac 86040 

ctggtgataa cttatagcct tgttatgctc tcctttgacc tccacgggca tgctgtacac 86100 

gtatgtgtgc ccacacaaac acacaatcaa gaaataaatg cagccaggcg aggtggcaca 86160 

cccctttaat cccagcactt gggaggcaga ggcaggtgga attctgagtt cgaggccaac 86220 

ctggtctaca aagtgagttc caggacagcc agagctacac agagaaaccc tgtttcgaaa 86280 

taaccaaaaa aaaaccactt taaatattat ttttattttg ttttgtttat cctggaactt 86340 

ggtctgcaga ccaggctggc cttgaactca cagagatcca actgcttctg cttcccaagc 86400 

acattaaagg atgtcccacc actgcctggc taaagattta ttttttcttt ctttttgttt 86460 

tgttttgttt tgttttttct aaaaaatttt tttaaaaaga accatccctc ctagcactca 86520 

ggagactctg aagtcagggc cagccaggtc tactgagtga gctctagggc agccagggct 86580 

ccacaaagaa accctctctc aacaaacaaa caaaagagaa cagacccaac cagacctgag 86640 

gacacacact tgtaatctaa gcccttgaga ggctgagaag ttcaaggcta gccacaagtg 86700 

tgtggtgcat tcaagagcag cctgggtggg ctacagaaaa agaaagaggg agagagagaa 86760 
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tggttaatga agatgactct ggaaaagtga aactcaagag aaagcccctc agatttgctt 86820 

aagacgagtt gagggtggag aaccgccaaa gcggacgagc cagacagaga ctgccaacaa 86880 

agttcaatcg gttcaggtac attacttcca aaacgccatt gccacatcag gatgcttcaa 86940 

tcagccaaac caacgcagcg actattgact tctgcatttc agagacttcc gtctctgtcc 87000 

agggcaatgt cactttagct ttcctttgca gaaaggaaaa gtccctgcct ctgatgtggt 87060 

agatcctcac acaccttctg ccagatccag acactggtat gactcagcct cggggagctc 87120 

tatctacaga gataagggta caaggcgtgt gtgtttaaag tatgtgttta aaagtacaaa 87180 

gtgagagtcc ctggaaaggg ctccctgccc tcaccatcac cgaaagcaca aaccttaggg 87240 

taatatctga cattcctgga aatgtatgta tgtattcatt atgtagccct gactgtcctg 87300 

gaatggggta taaaccagga tggcttcaca tctcagagac ccatttgcct ctgcctccca 87360 

agaactaaga ttagaggcat gcactaccat acttggctca tgatttactt aactttattt 87420 

tatgttcacg aatgttagcc tgcatgtatg tgtgtgcacc atgtgcatgc ctggtgcccc 8748 0 

agaggccaga agaaggtgtt ggttggattt cctggagatg aagtcccaaa caactgtaag 87540 

cagtccaatg tgtgtgctgg agatgaaact tggttcatcc acaagagcag tatgtgctct 87600 

taactgtgga ggcatatctc cagcctcaga tttcccagtt aatgtttgct ttcgcaccca 87660 

ggcccatctg cgcatgcgct ggagacctcc tttaccgcct tgagcctcat tggccaattg 8772 0 

tggctgggag acttgcagat cccaagtggt acaagagaag aataaactgg tgtgctatga 877 80 

actcacctct tctctgtagc cattggctga gcatactttg cctcaaccta ccgcccttcc 87840 

ttcccctaat cctaaatctt tgccctctcc aaatgtgctc ctcccccgca gtaatccagt 87900 

ggtcgctggg gctctagaga gatggggggg gggggagcaa cgggtacagc ttaaggcagc 87960 

tgcagcagaa cttttttgct gtatattgag tcttaaaaat tcatataaac tttgtgttct 88020 

gtttctaaat ataaccccat ctgtttcaac acaaaatgca acaacaaaat gtttcaaatt 88080 

gctatttgga ataattaaaa aatttcaata cttgatttaa aaatgcttta actttttaaa 88140 

taaattttaa atgttattat ttttaaaaag ttacaagttt aaaaaaaaga aagatagaaa 88200 

tcacataatg aaattaacca tacgcaagtg aggctcggtg cactggtaca cagttacagt 88260 

agccatttgg agtggaggcc atggcgcttc acattgaatt ttatactttc tttatagaat 8832 0 

attttttatg cacctatcta ctactgataa caaaacaccc atgagagagt tagaattaga 88380 

catcaattag ctttgatcct ctgtcataac tcgtgtccac tccctgcctt agtcctacct 88440 

catccctgtc ctcttttcta catcttatac tgaatccaca cactcagttg tttacacaaa 88500 

cacatacatc actgtccann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 88560 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnag 88620 

tgcctggctc tgttttgctg ttgttgtttt tgtttgtgtg tgtgtttaga tttggtttgg 8868 0 

tttggtttga ttttgttttt ttttagagaa tcttactatg tagctcaggc tgtccttgaa 8874 0 

ctcacagaga tctcttgtct ctgccttcca agtgctgaga ttaaaggtat acaccacctt 88800 

acctggcccc tttcatctat ctatctatct atctatctat ctatctatct atctatcatc 88860 

tatctatcta tctaaaattt atctgtgtgt gtctgtgtgt acattcccca gagcctgtgt 88920 

ggtagtcaat aagtaaccct cagaagttgg ctttctctaa tccttggatc aaacttgaat 88980 

tgttaggctt ggtagcaagc atgtttaccc actgagccat ttatgacccc atggcccagc 8904 0 

atcttccatg ggttctgggg acacaaatgt gtactttgat gtttacagga caagcgctta 89100 

accaaccaag tcattttccc agccccatcc tgactcccat taagtgttct ttcccccaac 89160 

ccaggaccaa atctagagga gtgtccatgc ccaacaaaca ctctgccaag cctctcccct 89220 

tactgctctt ctcccttccc ttccttcatt tcttcgttcc ttcttttctt tctttttgaa 89280 

acaggtcttt tctctgcatc ccaagctagc cttgaacttg tgatgtagct caggctggct 89340 

ttgaactcac agctgtcctc ctacttcagc ttcccaaaca ctgggattat agacctatgc 89400 

taccacacct ggctcatttt tcaaataaat aaaaagaaaa tcaaaaagtt cctagaacag 894 60 

tcacaggatt cacaaaaact ttggaaggag actaaaaatg gatttttaaa aaatgcttga 89520 

agcacaaaga gttgttgaaa gaagagagaa gaggaaaagt tagcttagta ggtagaagtc 89580 

aatcaagcct cacaccctga gttcaattcc tgaccctatg gtagaaggag aagagcaatg 89640 

ccggaaacat tatcctctga cctccagacc cgctgtggca cgtgcatgca cacacacaag 89700 

caggagccct tggagggaag tcctagaaat gaatcttact gaagcaggtc tgccaggccc 89760 

tgtgctcagc cattttattt ttcctttgtg tacccgacac gcttccattc tcaaagttgt 89820 

gagtctgaga ggaagtactc actgtgtccc cagtgagctt ctgtcttacc ctgggtcact 89880 

tagatggggt cacttagtgg tagccttggt gtggagaaag agaacacagg tccgagtagc 89940 

cagtagacct gagtctttat atctgcaaag ggtgttgggg cataatcaaa tctccccccc 90000 

tccccggggt cctgatacca ggttgtatta agtgtatgtg catggtcctt ccaagtcttg 90060 

acacatcatc caactccaag tggctttctc atttttcctt gccagtagcc tcttggtgag 90120 

gaaatggctg aggaaaacag agttgcagaa agacagggcc atggcctggc tgcaggcttt 90180 

ctctgagtct gaagagggtc agcgactctg agaaatgaag ctatttctga gtgagagggg 90240 

ccaaagaagg aacacggcag agggagagcc cccgaggaga tggagacaga agccggagag 90300 

ggaccctgtg cgaggctgga ggggaggaag aggggggagg agtgagaccc actgtcatct 90360 

gttgggcaga gaggggctac attcatctgc agtatggtgt agaggggaca gagagtgatg 9042 0 

gtaacaggaa aaatttgggg ttgagggggg cagcctgtag ggctgggccc cagcagtgta 90480 

cagctaggta gagtacacag taactcccag aattctctgg ctccactaaa tccctgttcc 9054 0 

gctccgtgca gagtaaaacc cacacagggt ggatttcagt ctcctttgca cccccctcca 90600 
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ccccccctcc acccccagct ctggtcacag ccagtcagag ttgggggtgg ggcagatctt 90660 

gtaaaagagg ctggtgagga catcggaaag tctgtaccct ccactagcaa agtgccagac 90720 

gctccgtgac actttaaatg cctcagataa aacagtgaga gactctcctg gtggcaggca 90780 

agatgatggg tcagggacct cagcgcctct gaggctcaga caccaggata aagaataaaa 90840 

acaccacgga gacccctgtg accccctcgt gcagagggag aatgccaatg tggcccagct 90900 

agctgctgga gcgcaagcct caaggcctgt gctagttatg agtctactgc tgctgccttg 90960 

tcccaataaa cccctttccg cccaggatta gtggacacgc cttgctcaag ccgagtccct 91020 

gatctgccac cttatcacac acatacaaaa atcccttgag gatggttacc atcctgggac 91080 

aaagcctcat tctctgctta acccaagtga cacctatatg gcagatccct gtgtccttct 91140 

cctgatgata acaacccttg caatccaata gaggggaact cgggtttctg tcagcttcct 91200 

ttatgctgat agaaatgtac tctgcatgtg gggagcctgc cttgctcacc ctgagacccc 91260 

atggggctgg ctggggcttt gcacatcatt gggactcaga gatgttgact acatgaacgt 91320 

cccacacttg gttgcacaag gcagaatgac aggatgttat gcctggtgtg tgagtgtgtg 91380 

tgtgtctctg tgtgtgtaaa acgcctctct ctggagccct cctgtctgtc tgcctcttgt 91440 

tcaatggctg cacaattgtc ctttctcttt ccaaggacct ctgtatgggt gtgtccttca 91500 

ttcagtgcct ttcctctgtg ggtttgtcct gctagccccc tgtcactgag aaagtcttct 91560 

gtctgtcctt gggttgtctg gctagaacac agacatcatt gtcttttttt tttttttttt 91620 

ttttttttaa agatttattt atatgtaagt acactgtagc tgtcttcaga cactccagaa 91680 

gagggagtca gatctcgtta ggatggttgt gagccaccat gtggttgctg ggatttgaac 91740 

tccagacctt cggaagagca gtcgggtgct cttactcact gagccatctc accagccccg 91800 

acatcattgt cttgcccacg actgctctcc agaatggtgg gcaggaggat gtgacccccc 91860 

acccccaggc accgggacac aacatcttct acacgtgtag gtcttgtgca ctggctttgc 91920 

tttcttcttc caagcaggtc tcccaggaaa tggcacttac agagattgaa gagtttaata 91980 

catgtctcgc tgcctctctt ttcgggaacc ccccagaggg agcagcagaa accagggctg 92040 

gcaggggctc taagctgcct gggcaaagga gcagggggta gcatggagcc ttagccaatt 92100 

tggaaagcac tgtgacccaa gcacattttg cagcagtaat gtcaaattct gccgttcagg 92160 

catgccattg atgtgcacgc tgccacacag aaaccagtga* cacaaaggca cagccttctc 92220 

caccctcctg gtgcttagga actaacggct ctaatgagaa atgagagctg aaaggagaga 92280 

gacgggggcg ggccacagca gcgcaggctg gcactgcgtg ttggaggagg ctgacccact 92340 

tctcgtagag gtaaggggcc cactgaaatg tcacttaaat tagccaccac tcccaacact 92400 

agatctcctt tgtccccata cctcagcccc acgcttcttt ctttttttct tctttttctt 92460 

ctcctctggg gcagcctcaa gcccagcacc cactttttag agctgtaaac caccctggtc 92520 

ctagaagccc tcttacgtta ggggatgaca ggaggtagag atcaggaagg agggagggag S2b *° 

gggaggagga aaggaaaagg gaggggagag agggaaagag atcgagagag catgcattca 92640 

tcacaaagag ccctcttttc tggctttttg actgcactgt gagttattta gccaacaata 927 00 

gatgtttatg tattttttta gaacccgtat ttattaacag cctgaaagga gagagacgga 92760 

gatttatata ggaagtgcag tgagttaagg ggggcaatta agagagcaga aagagatacg 92820 

gaacacagac ttgtaaaggg ttttgtaaca tccaatcaaa ggtgcttcag gtattttcca 92880 

aggaagcaga aggtaaaaaa aaaaaaaaat tgtcccatta gaagctgaca ctggatggag 929 *0 

caatggccca ggcggaactc ctgcttgaaa gaaggtgaga agggagggac acagaccagg 930 °® 

atccgatgag ccagagtgtg gccatagctg ggtcatgagg cccagggttg gaaggacccc 93060 

actaaagtgt gcactggcct ttccttgaca aaggatgcac ctatagctag gcgtggtggc 93120 
aagtggttgt tattctagta cttaggaggc tgaggcagga ggatcaccat gagtgtatgc 

ccagcctgga ctgcatagca acacccagtt tcaaaataac aacaaaagga agtgggggtg 93240 

gggagggcaa catttggaat gtaaataaat aaaacatttt ttttaaaaaa agaaaggggc 93300 

tagtgagtta gttcagcggt taagagcgct gactgctctt ccgaaggttc tgagttcaaa 93360 

tcccaacaac cacatggtgg ctcacaacca tccataagga gatctacgcc ctcttctggt 93420 

gtgtttaaag tcagctacaa tgtacttaca tataataata aataaattct ggagtgaggg 93480 

ggccagagca agtagaggtc ctgagtttaa ttcccagcaa ccacatgatg gctcacaacc 93540 

atctgtacaa ttacagtgca ctcatataca taaaataaat aaataaatct ttaaaaaaag 93600 

aagaaagagg gtggggcagg ggagggaaga agaagaaagg taagaagcta aataaaaggc 93660 

acagagatga gcttcatgtg gaaacacagg cctgtagtcc tggcactcag gtggggttgg 93720 

ggggggctac agtgagagta tcatgagttc aaggtcaact tgggtgagac cttgtctcaa 93780 

aaaatacata ngcnaaaaaa aaaaaaaaaa acatagccag gcatgatggt atacatttat 93840 

agtcccagca cttagaggac tgaggcaggg cagaaagaaa aggaattcaa gatcaggctg 93900 

agctgtatgc agtcctgatc ctatcccctc cccccccccc ccagagacag acagacagac 93960 

agacagacag agagaaacac aaagaaaggg gccttcagat ggctcagcaa ttaaaggcgc 94020 

ttgctattca gaccccatga cctgagctca aagcctggga cccaaggtag aaggcaagag 9 ^0 

ccaactccac agagctgttc tatgatctct atatgaatgc tggggcatgt gcctacacta 94140 

tgttgtgcac acatgcacag attagaaaaa gaggaggaag aaaaacataa gattgtttca 94 ^° 

agaaaagaaa ggctggcttc ttccacgtca gtgtgagagg agggtctggc ccctttgtag 94260 
ccaggtcctt cccagtccag tgggggctga actgaggcag cggaggaggc aataacggag 

ctttcccaac gcagtgtcca gcaaactcaa ctctacagcc tgtcctgatc cacagagaag 94380 

ccttcctggc tccctcacca atgcgggggc attggctccc aggctcctgg gccccccccc 94440 



93 



WO 02/29059 



PCT/US01/31488 



acacctgtgg agtgctaggt gatttgctaa tgttgggcaa catttgccca cgtggggttc 94500 

ttggctcttt ggtaatagac atgcctagca ggagggcgga gcttggaggg gggagtcctg 94560 

gggttgcccg tggctccctg cagctggggt gtctggccag ctgaagaagg agccatggca 94620 

cgcaaatggg agagcatgga acagaggctg tggatgctaa gcaatatggg aggcagtcta 94680 

agcttggaag cagcaggtgt ctgggaacgg gcctgtggcc caggcagatt tccagtgagc 94740 

actccagttt tttggcacaa ggaacaagct ggctgagccc aagaggcaag tggtgataat 948 00 

gaaacccgca gttgaggaac agcgggtaag ggtgccatgg gagcccatgt gctcatgaag 94860 

aggctggggt gtgaagaaga gcccatgcag ggaagccaca catcccctcg agttccaggc 94920 

agaggcagag tccctgagtg gggctccctg ggtctcccct tacctaacca gtctcccggc 94980 

accccagcaa acaaaatccc atccataatt tgaggtttat agagacctca aaggctgagc 95040 

tactgtgtgc cactaaccat cagcctaacc ctcccccact gtcttctcta gctgcccctc 95100 
tttcttctga gactgtgata gtggcgggga cgggttggga gtgtgtgtga agccctctcc 

gactctccaa ccccagctga gccccttgtt ctgcagctca gtaacacagt aacacaggct 95220 

cagttctaca ctggttgaga acactcacgg ctctctcagc tccttagaga gcctgttttc 95280 

tcattttcct gtccccaaag cctagacaat ggctggtcca tttgtaagct tatctgagga 95340 

tgccaggggc caccccatgt ctccactagg ctggcaatgt tctctgtcac tgtagtacag 95400 

aagactgcct ggtgggaggt gagataagga aagggatggt ctcccctggg gttcccacac 95460 

agtgctgagc ggaaaatggc agaatgggct gggaggtaac tctgttgcta gagtacttgc 95520 

ctagcatgtg caaggaaggg cctgggttcc atccccagca ctacagaacc caggcgtggt 95580 

ggttcatgct ggtattctca acattcagga ggtacagtca ggaagagcag aagttcaagg 95640 

ccatcctcag ctacatagct agcttgagac cagcctgggc tatgtgagac tttgtctcca 95700 

acaaacaaca acaaagcagc agaaggccaa ctggcaagag gagtattacg taaagtaaat 95760 

ccatctcaaa aagcaagtag catgtatctt ctttcatttt tttttacatt ctataaaggc 95820 

ctatcaagtc atgtatatat gcatgtatgt ttgtatgata tgaaagtagg gggctggata 95880 

gatggctcag cagttgagag cacttggatg ctctttcaaa gaacctgggt tcaattccta 95940 

gcacccacat ggcagctcac aactgtctgt aattccagtc tcaggggatc tggcaccctc 96000 

acacagatat ccacgcacat aaacaccaat gcacataaaa- taaataattt ttaaaaaaag 96060 

aaattggaag taaaactctc taaggagaca aaagggactg aggggaagtg ggaggggcat 96120 

gaagggggag ggcataggtg tgtggtgtgt ttaacatgca gaatacactt ctataaaagc 96180 

tttggggttc attatgcaat gtatacatgt gtgggtgcaa gatgtaagct gtgcatatgt 96240 

gtgggggcca aaggtctcct cctcaatccc tctctgcctt attttcattt aaattataat 96300 

tattactatt agtgtgtggt gtgatgtgtg tgggtgtgtt aagccctcac ggcaatcaga 96360 

ggatgtctgt ggtctgagga tgctctctta ccatgttcgt gtgggttctg tggatggaac 96420 

tctggtagtc aggtttgcaa agctagtgtc tttatctgcc gagccacctt gctggccttc 96480 

aaccttattt tttgcattga acatggaact tcctgagttg cctggacagc aagtccccaa 96540 

gaccctcctg ttcctgcctc ccccntgtcn nnntcacang aggacacacn gcttantggg 96600 

tntccggatt gctgcncacc tccccgccnc ccnagcctcc tgcctccccg cccctcgccc 96660 

ccgctggncc ctcccccccc cccccccccc cccccccccc cttccccccc ccccnnnnnn 96720 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 96780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnncagaca cacacactca aattaaatat 96840 

agctctaact gctgttaaat tcacactcct tcacatcccc acgctaggac tctaaggagg 96900 

caccagcaag gcccaggtcc agcttgactt agagcaaagc atcctccccc ctccacacaa 96960 

tggaaacgga cggaaagggg catggaagca gaaccagaca acagcagcct agccaagccc 97020 

aggactctgc tccttccccc catgcctgcc gtgcaactgg ggaggcaaag ccccagccgg 97080 

tgctttctga ccgcttagcg gaagacaagg ggagcctgtg attatgattt ctgctgattt 97140 

gcaatgaaac actaatgcag tgggcttttc attaagccag atttattcaa tctaaagatt 97200 

ttatttcctt tatgtagaaa gtgcatcttt atatgttgtt ggaggagcag agatgtgata 97260 

aaaagaaatt tctcttatga actaatagca ctgatacata gtggtagcta tgcctaggcc 97320 

tctctctctc tctctctctc tgtctcctgt gcatgtgtgt gtgtgtgtgt gtgtgtgtgt 97380 

atgaatgcac acaaagtagc ccccccccat attatttctt ctgtgggatc tccagactca 97440 

gcaaatggtg gtgactggga agtctggcca tgcaattctt gccttttctc ttgccagccc 97500 

aatccctttg cattcaaacc cgggctgctt gctgtggcca gccctttcac ctggagtcct 97560 

tcctcctcct tacctgtctt cccatccttt gcagacaatt atcctcaata actagccaat 97620 

tacccttaag gacaattata ctcttccatc agcaaacacg ggtgttcttt ccttgagtct 97680 

tttgatgaag tcgatattaa agagatgctt tatttacata aagtcaaata gctccctttt 97740 

agaagggttt gggttcgatg tcaaagtttt aaaatcttaa ctagaggatg ggtgtagagg 97800 

gcttttggct agggtagaaa agagatggag atacttattc tgatgttgct ttaaaaggta 97860 

ggatgcccag agaaggtgga aggatggggg agggagggtc cctcctcaag ctaatgaatc 97920 

taaaagcagg gatgagctgg gcgctaggag tggaaccagt cagaagtgtc tgcctttgac 97980 

tgaccacagc tcctgccctc ccctccccca gtctctctgt gaaccgccag cattaggagc 98040 

taatcgcttc agaaagccag attggaatgt gttgctcacc ctccactgct cagaaaacct 98100 

ttattccagg caaggactga cccaaaccga tcatggcatc tgccaatcag gaggccaaag 98160 

gtgccggcag ggcgggacct agctgtgcag aaacagctcc gttatggcgc gcagaaaaag 98220 

ctggggggaa aggctaccgt tttatctctt ggcagatggc ttctctcttt gatgctttgg 98280 
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gccttacctg 
gttctcgaag 
cccccgctgt 
tcggaaattt 
gtagagtggg 
gaaagaggtg 
ctggcttccc 
tattaggacc 
cctggcttct 
ccctaaaggt 
ggacgctcct 
tactttcttc 
ctgagtgaga 
aacaggaaag 
ttacctttcc 
gcttcaggct 
aacaagggga 
agccagacat 
atctgccatg 
atgctggcct 
gctccttgga 
tcttagctgc 
agaccctcca 
tccacaattg 
gcaaacccca 
tgagccctgc 
gatccatgct 
ctcatcctgg 
gattaaaatc 
acaaagccct 
ttctaagtgg 
cccccccccc 
ataattctcc 
gaagcacgct 
ctctcaagtc 
atttacactt 
ctgaggactc 
ccctggcctc 
ggtacaccgg 
aaaggcttcc 
aaggtaaatg 
ccttgttacc 
ctgctgagct 
gtctccccac 
ttcatttgca 
tataagaaaa 
ggccaaacct 
ggcccggcac 
gatgaatgaa 
tgtgtgaggg 
tcctggggcg 
tgtctgtaac 
acgtggtcca 
gcctctgaag 
aaaacaccca 
caaagtgttt 
ctcagagtga 
agattcacgg 
cctctctctc 
ccatgtagga 
tgaacgagtg 
aaggcaagag 
ggttctatct 
cccaaccaga 



ttactgcctg 
ttagaaggaa 
gtttctaaag 
ctttttaatc 
agttccgtgg 
ttgtgggcct 
actctcaaga 
agagggactg 
ccaggagccc 
atggggatga 
gtaccccaga 
tgttgttgtt 
aaaaagagga 
ggcaggcaac 
tgacccccag 
aagtaaacag 
ggaggcagtg 
ttcaggcacc 
cctgccatct 
ggaggaaatt 
aacccacgtg 
ctcaagcact 
gggagtcatt 
aggagacgcc 
gccactgagg 
ctcccagggg 
tttatgacat 
ggaccaggga 
attttttttt 
cctgtgctga 
agtgtgatgt 
aaacttactg 
ccatctccct 
gaagaggcca 
tacacggagc 
catgccgcag 
catcaagcac 
cttacagatc 
gggctgcagc 
taggaaatca 
gatgttattt 
atgcagggct 
gagggagggt 
ccccgccccc 
gcagcttaca 
agaggaatga 
ggccgagaga 
aacagggccc 
tggatcatct 
agatgtcaac 
tcagcttcag 
ccgcagaggg 
gacccctctt 
gtatgtcacc 
aacgaaagaa 
ttcagcatga 
gatgcggagg 
ctggcagaaa 
ttttcaaggt 
cctggggcct 
ataggaaccc 
caggaggcat 
gggaggcagg 
gagaattcaa 



cacttgactt 
aaaaaaaaag 
agctgttttc 
atagcaggag 
ccacagagag 
gtgcacacat 
tgaggtgtgt 
tgtgtgtgga 
acatgccaac 
gggaccaagt 
ctcagccgcc 
tttggaagtg 
gagggaggaa 
aagacaatga 
ggcttagcca 
ggaagagttg 
gccaggcagc 
tctccttccc 
gagagaggcc 
ggtctttagg 
acagagctcg 
gtaaggttta 
gcctaccctt 
agacctggca 
aacttgcata 
acaactgggc 
agaaaggcct 
aggcgagcat 
tctgctccat 
aattagaccc 
agtggcagga 
tctcttaacc 
cctctggtgg 
gactcgggat 
tgatttattt 
cttcctgcgt 
gctgccttgc 
atgacctcct 
ctctctatgg 
gacgctgcta 
aaatgttgcg 
gtggacgggt 
ggggacacac 
gccctcactc 
gggcttgttg 
aagagaaagc 
atccatgacg 
gccacaagag 
gtccttagag 
agaggttccc 
tcggaacagc 
aagggcgggg 
gggttgatct 
gctggctcag 
aaactttgtt 
caactcactg 
gaagtggatg 
atggctgaga 
ggtttttgga 
gtgcagatgg 
aggcactaaa 
ttgagggtaa 
agtcccaatg 
gatggcagag 



gacctaggca 
ccccaaacca 
ttcccaagct 
tcccaattag 
cagaggcaat 
gtgtgtcaac 
gcaccccagg 

ggggtgttgc 
aaacaggctg 
gctttgcaag 
acccagggcc 
ctgatgtcaa 
aagggggggg 
ccacaaggtc 
atatagctga 
gacatgggtc 
catgcccacc 
tgggtgccta 
actgggactt 
gacactgaag 
catgacaact 
ggagagcccc 
ctgaactcta 
ggggagcaag 
caagaaactg 
aacagatcct 
cagtctcagg- 
cttctgctcc 
gaactcatac 
cgaaaaatag 
gcgcaggatg 
tctcgagtcc 
gctggtggaa 
gccatgtaag 
acctcccgtg 
ggcacggcag 
caggatgaca 
gccccgtgag 
gggaggctga 
gtaattaagg 
tcatttaaag 
ggcaattagg 
cttccggtaa 
cccacccccc 
cccttaccca 
cagaaaagtt 
gtccgcgcgc 
cttcttgagt 
cacaagccag 
taaagaatga 
cggaccgtta 
tcacccagca 
gctcctgaaa 
ccagcttatc 
tcaagagcag 
cccgtgacaa 
acaacagctg 
gccttgggct 
aatgtccttc 
ccctgtgttc 
cacacaatgt 
aggagtgtaa 
ccagcaaata 
ggacaggctg 



aaaatagcag 
caacacaacc 
ctacagcgtg 
cgtgttgggt 
attcagcata 
gcacatgtgg 
ccttttgatt 
tcacagtgca 
cacactcttg 
acagcaggca 
agctctgatc 
tgcagaattc 
ggacgggacg 
cctgtaacta 
gacccagtct 
tccattctct 
gatgccatcc 
gaggtgctgt 
ggtagagagg 
gcagtttcct 
tgccggctct 
agatgtaagt 
acatggtcca 
cctttgtttc 
cctctgggcc 
tccaggtggc 
atttcacaca 
ccccaaacaa 
agttttcaga 
gttagctgac 
ggctgccagg 
ctgggtttct 
agcgtaatga 
tacacagcat 
aaagagacaa 
caccccctcc 
gcagcccact 
gtctgtcacc 
ggaaatgaat 
agcgaagcat 
agtgtcctgg 
ctggcacggg 
ttgctgctgg 
acttctttcc 
gaaaatcacg 
catattaggt 
atataaccct 
tgccacctgc 
gcctgattct 
tgcttctatt 
ccttagctct 
ttgccactcc 
aacagtgttg 
tccccggtgc 
agtgtggtgc 
ccagtacttg 
tatccaggtc 
gcatccctcc 
ctgtgggttg 
ctggtgctgc 
ggttgtatct 
ggactgtgta 
caatggactc 
tctgagtttc 



cgagatacag 
cggaagtgtg 
gtggctctaa 
aatctttcaa 
aagccctaga 
cttgtggagg 
ctcaaagctt 
gaaacccaaa 
ctagtacatc 
cagagttctg 
tggcttgacc 
agcagagtgg 
ggccgaggcc 
cactaactgc 
tggtgctgtg 
ctcctcatcc 
ttctgggagg 
gtctgcatcc 
ttctccacac 
ctgttcagtg 
caactcccat 
atggatggga 
gcttttccat 
tgacccattt 
tctcctggac 
tgcagtgaca 
ctgtatttcc 
gtgtgggaat 
taccgaggag 
aattacttgt 
gctgcagtct 
tgccgggatg 
atcaacgctt 
cgccagccac 
caatcatcat 
ctctccgctg 
ctcagcctct 
cgaaaaccac 
tccgtaggta 
aggtgcgtga 
tgcttcagtt 
tagagctcac 
gcagctctgg 
tgacagctct 
ttggaagaaa 
tcggatctgc 
gtggcaacag 
caggagacag 
ccaatattga 
tccatgctaa 
gctgttctcc 
ccccaccctc 
gctcaagttt 
tttcaagatc 
caactctgat 
gctgttgtgg 
caaacagagt 
tcccctcctg 
tgtgcctttt 
tgttgagatg 
gactagaagc 
aagagatgag 
tcctggtcga 
ctatggctgc 



98340 
98400 
98460 
98520 
98580 
98640 
98700 
98760 
98820 
98880 
98940 
99000 
99060 
99120 
99180 
99240 
99300 
99360 
99420 
99480 
99540 
99600 
99660 
99720 
99780 
99840 
99900 
99960 
100020 
100080 
100140 
100200 
100260 
100320 
100380 
100440 
100500 
100560 
100620 
100680 
100740 
100800 
100860 
100920 
100980 
101040 
101100 
101160 
101220 
101280 
101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101820 
101880 
101940 
102000 
102060 
102120 
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accgataaat 
gactagaaat 
tcacactgtg 
gggacacatt 
gagaaatggc 
ctagcagcca 
ctaatacata 
cacacacaca 
ggctcagcaa 
catggtggct 
acagatacaa 
gaaagaaaga 
aaagaaagaa 
acttccaggg 
gaagctgtag 
gtagacaacc 
caacgcctca 
taggacactc 
agacctatct 
catccaactc 
atctgctctg 
gaggactttg 
ctctaccctg 
ctgtgcccac 
agactgagaa 
aaatgtctta 
tttgtgtgtt 
taactaagga 
caggtgtgtg 
tgcatattag 
tttgagacaa 
accagaagaa 
atttgaactc 
agtccttgag 
gcttcagcct 
cttccttccc 
cttcccttcc 
cttcctttcc 
taatcctggc 
nnnnnnnnnn 
nnnnnnnnnn 
cgacaacacc 
ccaattttta 
atagctcctt 
gtcctagaac 
atgatagact 
aaagtcatcc 
ctcaaaaaga 
gggaggcaga 
ccaggacagc 
agtgaacccc 
tgtttaccca 
tgtttgaatg 
ggcagtattg 
gagtttaaag 
tggcctgctc 
ctgtttcctc 
tatactacac 
tctagagatg. 
cctccatctg 
atgagacctt 
tggtggtggc 
agaaaggctt 
aataatgatg 



ggtcataagc 
ctagctcaag 
acatgatgct 
catggaccgc 
ctggtagtta 
tgttgggcag 
catacataca 
cacacacaca 
ttaagagcac 
cacaaccatc 
tgtactcata 
aagaaagtgt 
agaaagaaag 
ctataggcaa 
acaacctgtg 
tgtggcctgg 
ccctctgtgc 
tacatcaaga 
ttacatgtga 
actatgtgac 
agagtcccct 
gatctgcaaa 
aaaccctttt 
aagaagtccc 
cattgtgtct 
ctgcctttat 
tgtttgtttt 
tgactttaaa 
ccaccgtacc 
gtaagcatgc 
tgtctttttt 
ggcattggat 
aggacctctg 
acaatgtctt 
cctaaatact 
tttcccttcc 
cttcccttcc 
ctctctctct 
tgtcttggaa 
nnnnnnnnnn 
nnnnnnnnnn 
ggcgccgctg 
aaaccagtgt 
actttgtttt 
tcaatgtgta 
gatggtgtaa 
ttggctatgt 
aataaatgta 
ggcaggcaga 
cagggctaca 
aacagtactg 
ggtgtaccca 
agagaaaagt 
gctggagagg 
cttcctccac 
ttggctccgg 
ccccacccct 
tggtgatctt 
tcctttcatg 
aactttagga 
ggagcctgca 
agtagtagaa 
tgggtgcaca 
acgatgaaaa 



agagtagagg 
gttgtgtgtg 
gtggtctgca 
aggttacaca 
agtatgcttg 
cttacaacta 
tacatacata 
cacacacact 
tgactgctct 
tgtaatggga 
tacattaaat 
aaacgaggaa 
aaggaaagga 
cctgtggcct 
gcctggggaa 
ggaagctgta 
ccaatttcct 
tgatcttgtc 
ggtcactcca 
agagtcatct 
gcctgcccgt 
gccacatctt 
ctctgaggtg 
ggggagaagg 
cacctaaaat 
tccttttcct 
cttctgagat 
cttctgatcc 
ccagctttat 
taccaacttg 
tttttaatta 
cctattagag 
gaagagcagt 
gctatatggc 
gggattacat 
cttttccctt 
cttcccctcc 
ctctccccct 
cttgctctgt 
nnnnnnnnnn 
nnnnnnaaga 
cctccactgc 
ttcaagccgg 
gctttgacgt 
aattaggctg 
aactccactt 
tgtgagtttg 
ttgccgggca 
ttttctgagt 
cagagaaacc 
ccggacagtc 
caaggtgtgt 
ttcccatcag 
tgatggggag 
gaagaccact 
cctcagttta 
ctccctgtct 
gactgtattt 
gctgggacct 
aacttcttgg 
ctttgttaag 
ctctgatgaa 
tggctataac 
taattctctg 



aaaaccacag 
gcagggttgg 
gatgtgttgg 
tgctatttaa 
ctgatcttcc 
actatgactt 
catacataca 
ttaaagaaaa 
tacagaggtg 
tctgatgccc 
aaataaataa 
aattcctaat 
attctgaggg 
ggggaagctg 
gctgtagaca 
gacaacctgt 
gttctctaag 
tcaagatgtt 
caggttctag 
agagatttgt 
gggaactccc 
ccaaaaccat 
gcttttagag 
gacccaaggg 
cggtggtcgc 
ccgctccatc 
gtagcccagg- 
tttcttccct 
ttgagactat 
gctatattcc 
tatgagtaca 
atggttgtga 
cagtgctttt 
acatattggc 
gtgagccatg 
ccctttccct 
cctcccttcc 
ctttcttttc 
agaccaggct 
nnnnnnnnnn 
agaagaagaa 
catccacctg 
tacactgaag 
tttgtgacat 
gccttgaact 
aggaggcaga 
aggaccaacc 
tggtggcaca 
tcgaagccag 
ctgtctccaa 
tggtgtcttt 
gagcagctct 
ctcgggtgtg 
gtgtagcctt 
gggctgatct 
tctaaaattt 
gtattccctg 
cagtttacct 
ggctcagaaa 
cttaaaggtg 
caccctgggt 
cagttagtta 
tcagtggcga 
tgactgttct 



acagaaattc 

ttcctgggtg 

gctgcatcca 

aaactccaag 

agaagacctg 

ctgagctcca 

tacatacata 

aaattctggg 

ctgagttcaa 

tcttctggtg 

gaaagaaaga 

taaaaaagaa 

agaatctgcc 

tagacaacct 

acctgtggcc 

ggcctgtggc 

gacacatgcc 

taacaaaatt 

acatattttt 

gtccaggaca 

cagtggtcct 

tttcctcttt 

aggcaggtct . 

ccagtgctga 

aaggaccaag 

ttactcctca 

ctggccttca 

ccacttccag 

gattcaggct 

cagcctttct 

ctgtatctgt 

gccaccatgt 

aaccgctgag 

ctcaaactca 

gtgtttggct 

tttccctttc 

cttcccttcc 

tttcagagag 

ggcttgnnnn 

nnnnnnnnnn 

gaagaagaag 

agacaggact 

tagtagtccc 

ggtgtgatgt 

tgcttctgcc 

ggtgggagga 

ttggctacat 

cgcctttaat 

cctggtctac 

aaacaaaaaa 

cctaagtctc 

atacccagag 

tgaatactcg 

ccggtggaga 

tgttaccaac 

acatgttacc 

ccctctagtg 

cttgttctct 

tttctaaggc 

tatttctgac 

ggtggtggtg 

ttcaaggccc 

gagacgtgct 

ccccacttcc 



atgccattga 
ttcaaccttt 
tagctaccct 
ggaagggcta 
agctctgttc 
aagctctctt 
cgtacacaca 
ttggagaggt 
ctctcaacca 
tgcgctgaag 
aagaaagaaa 
agaaagaaag 
ccttttccta 
gtggcctggg 
tggggaagct 
agcatcatgt 
atcaaatgca 
acatctgcaa 
gaggagccac 
gactggctgt 
taagggccct 
tggagagcta 
cagcagggca 
actatcgctg 
caggctctat 
tttttgtttg 
gctcactatg 
agtcctgggg 
ccatacttca 
ttctttcttc 
tttcagacac 
ggttgttggg 
ccatctcgcc 
gaatccttcc 
tctagccttt 
ctttccttcc 
cttcccttcc 
tttctctgtg 
nnnnnnnnnn 
nnnnnnnnnn 
acaacaacga 
caaatccaga 
acttgggatt 
agtcttggct 
tcctgctggg 
tcagaaattc 
gatatcctat 
cccagcactt 
agagtgagtt 
caaacaaaaa 
ctttcaactc 
gtgatacggt 
gtccccagtt 
tgggctttgg 
agaattggat 
tgatcaaaaa 
gtgctggctg 
ctctgctgac 
actgagcctt 
ttagtatgca 
gtggtggtgg 
atctagaaaa 
tcccatgaat 
ctctctctca 



102180 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960 
103020 
103080 
103140 
103200 
103260 
103320 
103380 
103440 
103500 
103560 
103620 
103680 
103740 
103800 
103860 
103920 
103980 
104040 
104100 
104160 
104220 
104280 
104340 
104400 
104460 
104520 
104580 
104640 
104700 
104760 
104820 
104880 
104940 
105000 
105060 
105120 
105180 
105240 
105300 
105360 
105420 
105480 
105540 
105600 
105660 
105720 
105780 
105840 
105900 
105960 
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ccctagctct 
cggaacgtta 
tggctagtca 
tcagcaagaa 
gctaagtggt 
ctacactgtt 
aactggaatt 
gggatatttg 
gtagaggcca 
ctctcgatca 
ttggccttga 
agaggcctcg 
gcttgttgaa 
tctgccaagt 
cttggagctg 
taagtggcat 
atcagctgac 
ggattctgac 
tccctgcatc 
ctccccagca 
atctcctcag 
ttgatgaaga 
ccacccatgg 
gtaagaagga 
ctggggcgga 
tttgaagcaa 
ctatctcagg 
gctcttagaa 
ggagggagtg 
gccaagtcat 
cccctaggga 
ggcctgagtc 
ggtttctggt 
tctctaaaag 
ttcagcgact 
ctagacctcc 
ctcgagctag 
gaaaaaaaaa 
gtggctggct 
ttagctcagg 
tgtagccctt 
acagagcgcg 
gcactgcaaa 
gctggtctaa 
ttggttagtt 
ccaatcagag 
tgggtgtggt 
ctcgtgcgtc 
ggaagagaca 
agcaaccccc 
aaccgcggta 
ggggagggga 
ccgagtagtg 
ttggagaagt 
cgccctttgg 
gaggggtgcg 
cctggcaggg 
ttatgtaagg 
aaaggttagg 
ctggaaactc 
gtcacctttg 
ctgactgcct 
cctctctccc 
cgttcccacc 



tatctaccga 
gtcacttccc 
ctgaggaaaa 
agaagctgac 
ttcaagaagt 
ttgtagatga 
tgtttggcta 
gggacacttc 
gggtggtact 
aaacatcaac 
ggtgggtggg 
agtgaggagg 
acttttcttt 
cagtcccggg 
gggcaacaca 
ctgagaagtt 
ctaggaagga 
ccactaggct 
atccgcagga 
gtggcctccc 
aagtctgaat 
aaagatcatg 
cggggttgag 
ggatgtggtt 
aggcaggaag 
gcccagcagt 
cttccctctc 
atgactcaaa 
aagaggtggt 
tgggttcatt 
aggagccgcc 
agagaagcgt 
tgatactact 
tgagggcgaa 
aagacgagac 
agtgaagatt 
ctctccagtg 
aagtcctatt 
tccatcttta 
cctcctgtga 
taaggacttc 
ctgcattctg 
aatacacgcc 
gcaagtcgcc 
ggtaggcgtg 
cccagacgcc 
cacgttgccc 
acgacgccgc 
gagcggtcgg 
gcacccagca 
accgggtgat 
gagggggcgg 
tgtctggacg 
aactggggag 
tttgaggatg 
gaggtgcatt 
aaggggtcgc 
ggggagggga 
gtttcaccgc 
tacccgcacc 
cacctccccc 
gggaagccgg 
cacctcctgg 
tccctctggc 



atccctgcac 
ttcgcctact 
cggcagctgc 
agcaaatgtg 
ttcttcttaa 
agaaaacacc 
attcagtggt 
tggtcgtcat 
gaccttccta 
agtgctaagg 
aggaggttag 
gaacagggct 
cttccaagct 
acttcgcact 
tttacctggg 
aaaagtgggg 
agagagcaac 
tatcttccac 
gcccctggag 
cactaactga 
aagtaaggga 
agaagcatac 
ggggaagggc 
agggctgatt 
tctccttaaa 
caccttagtg 
atttctttgt 
agtcaagaac 
cccagtcttg 
tcggtttttg 
tcttcattgg 
ttcggagggc 
ttggtaaaga 
tcctttgtta 
attcactaga 
gggcctgtgc 
aagactgggg 
tgtggaaaaa 
gttctactca 
atcagttttg 
cccgcgaccg 
ggaagctgag 
tatacttcct 
taggccgagg 
gcttagagaa 
tgaatgggcg 
agcaatgggc 
cagctgatcg 
ccgtgcggac 
tcaggtgggt 
ggggaaagta 
cggggacagg 
tttaaagaga 
gaatatgaga 
cccgagctga 
gcctctaccg 
tgggcggtcc 
aaggagtagg 
gggttgggcg 
ccccctccca 
ctagtatgtc 
ctggctgggt 
ccaactcttg 
cctagtttca 



aagcacccat 
tcagtattgt 
ctgtgggcca 
ccaactgtgg 
ccccctacaa 
attccgaagc 
tctcaaccag 
aattgggatc 
ctatgaacag 
ttgagaaatc 
agagtggaat 
cctatgctag 
tggctagagc 
aagtttgtcc 
tcttcccagg 
tgagtgggta 
tgaagcagca 
agcctttctt 
cacccacatt 
cagtggtgac 
cagatgttgg 
attttacccg 
agaaaaaagg 
cgtcccctgg 
gacacaatat 
gagcccatac 
ctcagatgct- 
cccaggccac 
tccctttaaa 
cccatccccc 
tctccacctt 
gggactgaat 
tcttccccta 
agaacgtgcc 
aaagatttca 
gggtgacatt 
ccgtgcgagt 
aaaaaagact 
ctcctgttgc 
aggctaaaag 
agtcagagat 
tgtcaccgta 
tctgctcttt 
gttagccaca 
gctcctccag 
ggagtaagca 
ggtgattggc 
gagactggag 
aggtcgcagt 
gtgatctggg 
gggtcctgac 
ggcgctcttg 
gagtcccgga 
ggccagaggg 
ccatttagcc 
gcgctgactg 
ggcccctcct 
gggcggcggt 
gaggttgggg 
gcctaactgg 
cggtagagag 
ggggcgcctg 
gcccctcccc 
gtctccaact 



ggggtttaca 
gtttccagaa 
ccagcccatg 
gtctgctgga 
gaaaccacaa 
tcactgccag 
ggtggatttg 
atgtgctact 
ggcagccatg 
tcaggctgaa 
aaaatcagaa 
ggataatgga 
cctgctcaat 
tggagtgacc 
agtgggttaa 
aaggcaggag 
aagagctggt 
gtttaggctt 
cagccggccg 
aggaaatatc 
ggagaggcgt 
ctatggttgg 
agatggagaa 
ggtcgaacaa 
tctgaacgtt 
ttaatttaac 
ctcacttaga 
aagtttctct 
taagcaattc 
gcctttcaga 
tgaaatcact 
gggtgttaat 
atttttaaaa 
ccttgagaag 
ctaaacccac 
tgtccctata 
gacagtggtc 
tcgggtgttc 
ttcgcgtgct 
aagttccaag 
cagtttaaaa 
agaacttcat 
aaactgtagt 
ccttttcagc 
gcaagggggt 
gaggtgctgg 
cctgggtggt 
ccggtgtgtg 
gattttgctc 
gacccggtca 
ggccacaccc 
ggagaggagc 
caggagtcgt 
ccgggggcgt 
tagggaggat 
ggtcagggcc 
ctcgttccct 
gcggaggcct 
ggggcggaca 
ctgtcttgga 
gcccctagcc 
ggttagtcat 
acggcctccg 
catttggcct 



gaatctgggg 
gtacccattt 
ccaagtgagg 
tttctactgt 
attttattat 
tcagcactgg 
gactccccag 
ggcacctagg 
tacataaatt 
accctgtcat 
gggccaccac 
gaatagggca 
ttcccccaac 
ctgactccag 
aagtcaaaga 
gaccccatca 
ccaggagact 
gggctcagtt 
ccaggacagg 
tcccattcca 
cactcttggg 
ggttccatta 
ggacagacac 
tcagctgtta 
gaactcagga 
acagaagcgg 
aagcttagat 
ttgggggtgg 
agcagctttt 
ctctgattgg 
tccctaagta 
cttagaaccg 
agacgcttcc 
ccgtgggctc 
gagggataga 
ccccgaagac 
cctatacccc 
tgctgcatcg 
ccaccttcgc 
aaggaggggc 
atgccaactc 
tgaccggaat 
ttgacgtaaa 
cattggccag 
ggcctccttg 
cgcccccgag 
tcattcgcag 
ctgggcgctg 
ctctgtccac 
tcccgggggg 
tgcccttctg 
ctggactctc 
ggcagaaggt 
ctaaccccga 
ctggacgagc 
agttcaagtc 
cccggggatg 
tatgcaaccc 
ggaggagtgc 
cagagagaag 
cgggcttggc 
cgctgggctc 
gttaggctaa 
gtcaccctgg 
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ctgttagagt 
gggtcagtgc 
tcagcagcca 
gagactgatc 
ggaatttgaa 
aggcaaagtc 
gaaccaccct 
ccatgctgta 
cttggaactt 
cagatgtacc 
cagccaccct 
tcctgagagc 
aaccctcctt 
atttctgaag 
atgccgaaga 
aggggcacaa 
gaaggtactg 
acccagcttt 
ccctcacctg 
aaagctgcca 
gcagcagctt 
tggggtgatg 
gcccagtgaa 
ctccccaaga 
tgtttactga 
cacacacaca 
cacacacttt 
actccgttct 
tcgagacagg 
tggcctcgaa 
tgccaccacc 
tgggtttagg 
ccatagataa 
tcttcttctt 
caccccctga 
gtcccaggca 
ccaaggcttt 
tgccatttat 
gcaagaaatt 
cctgattcat 
tcctaacaga 
ggagttctag 
ctaatccagc 
cgggctgggg 
cgcttgcttt 
atggctctgc 
caacagaggc 
acattggtca 
gcctcagtga 
tgaagggcct 
aggagcatga 
gtaacactgt 
aacatgcacc 
ggtctgcaat 
ctgctataca 
caaaataaaa 
tccctgatca 
tggcagtacc 
gtatcctaaa 
tggaacgcag 
tagggcttgc 
gtcttacaga 
gtatgagttt 
accatccaat 



aggctagaag 
cctgggccca 
gtgaaccctg 
acctaggaga 
agttgcccac 
tccttggccc 
ctcgggctcc 
gggccccggc 
ggaacaaagt 
cggaatgtgc 
acccatacct 
ccctgctgac 
gctgctgaat 
tacatttttt 
tgctgggctc 
agcaagtgca 
aaagtcagct 
cctgtttttt 
gctgagccgc 
gttgaagaac 
tgcctatcat 
gctttatggg 
agatttaggt 
ttagttgatg 
gacttccttg 
cacacacaca 
cctttccatg 
aagtagtcat 
gtttctctgt 
ctcagaaatc 
gcccggctgt 
ggtacttacc 
gtaggcagga 
ctcgtggatg 
ctactcaccc 
agctcgtggg 
ccctccacct 
tttctgcctt 
gtcaggggag 
tctcacgtat 
agatgggttc 
cctgttcccc 
cccactgcag 
aggctgagtc 
ctttcctcct 
tgtcctcatc 
aggcacaaga 
gagaggcacg 
aagacacgtg 
ccaggaagta 
gccacctctg 
gggaaaacat 
aagccctggt 
cccggcactc 
atgagttctg 
taaagcattg 
taatacaaca 
aaccattgtg 
ataaggattt 
ctgtttgttt 
ctaagtaaat 
agaaaaattg 
gtgactgttt 
tgagtggaaa 



ctgtcatggt 
ccccgccccg 
tggcctgaac 
aggaagatcc 
cctagtgtaa 
ttctatctgg 
cagccctcta 
ttttactgct 
tcagacgtgg 
aagcggaatg 
ccctcccctc 
cttccaactc 
tccctaagaa 
tttttaactt 
cttagcaggt 
tttagaagcc 
agagccaggt 
tttttcctcc 
agtagttctt 
tgttgccctc 
ccggaaggtg 
agggaaaccc 
taaaggcact 
tctgtgtggc 
taggcctgcc 
cacacacaca 
aggtccaaaa 
ttgtgtattt 
atagccctgg 
tgcctgcctc 
atttacattt 
cttacacctg 
gggcattaaa 
gaaacgaaac 
agctcagggg 
aggctagtcc 
aaagcaaaac 
tgctttccag 
atgggttgta 
gggcttggtc 
agggggtaca 
atttgtgaga 
tctaagctga 
ggctcacaca 
cccttccctg 
caacatggag 
cagtggagga 
tagaagcctg 
cttccagcca 
tggtcccatc 
aaaggaaata 
atctatgact 
tctgtcttct 
tggaggtggt 
agccagcctg 
gttagtaatt 
agcaaatgaa 
agagatgcct 
ggttataaaa 
ggaagtccat 
tatattcatc 
agtggaaata 
ttaaaatatg 
tataaatact 



gccagagagt 
cagccaaggg 
agagctatcc 
gacaaagttt 
tctttccact 
cagtggccat 
gcctgccacg 
gattcatgcg 
aggggccggc 
cctggcatct 
ctgcctttgg 
tagtgcccct 
caagtcattt 
gggacttggg 
tgccaagagt 
tcttgcttct 
ttggatggcc 
ccttcctttt 
cagtggcaag 
tgcccctggc 
acagaactgg 
tggtcctctg 
gtctataaat 
agtgggaaga 
tttcttatct 
catacacaca 
gtaaatgtac 
actttttttg 
ctgtcctgga" 
tgcctcccaa 
ctttatttat 
tggatttttc 
agtccaccag 
agctcttcac 
gattaggatg 
tctactggct 
tgtagctctt 
gatagtgaga 
tgatatgagt 
tctgattgtg 
ggaggctgtt 
aactgaaagt 
gggataggat 
ttgcgacaaa 
gccacagtgt 
cctcagaggt 
cctggcctgg 
gagaacacca 
tctcctctca 
tctaccctgc 
cacagcaaat 
ggggttgtag 
gcattgcata 
ggcaaaggag 
ggctatatga 
caaagaaagc 
agccagaaga 
ttggacctgg 
tgttcatctc 
tccttttctg 
tgatggtggt 
taaatattga 
aacacatact 
gaatactaaa 



tgatggagca 
cacctgcttg 
tgggcagaga 
atacttccca 
ctctgaaaat 
gtccttggac 
cccccagccc 
ttggaactgt 
agacagcctg 
ctagtcctga 
tcagctgtcc 
cccatttcta 
gagttgatca 
ttctacaccc 
tgccagctcc 
tattcaagaa 
tctgggtcgc 
aggaacctgt 
ctttatgtcc 
ttcgtggagg 
ggtgggaagg 
gggagccctt 
tggggaatag 
aatagaagga 
tcatcatcac 
catacacaca 
tcaggaaggg 
tttatttgtt 
actcactttg 
gtgctgggat 
ttttagtctg 
cacctgtata 
tggtgactca 
atgaactgtt 
gaaggaaagg 
tctcaccatg 
ggttgggttc 
ctctgctcaa 
cccttctgct 
gttcaccttt 
tgttgtattt 
cataggggag 
gtgtaaggga 
gattgccctt 
gtccctccag 
gagaaagggc 
aaccacaagg 
ggaaagagag 
ggacctgcct 
agtttctata 
tcaaaaagag 
ctcagttggt 
aaactgaaca 
cctacattca 
gactgtctca 
agatgtggct 
aggctcctgt 
tagtttgctg 
agggttgtaa 
ctgtcatgaa 
gttctgtgca 
atactaaaaa 
tgtaatatat 
aagattatga 



gctggtcaga 
gcacaaactc 
gaagtggaca 
agaggctttt 
agaaatccca 
tgactgtgca 
cctccctgag 
gggggcgggg 
gaattcatac 
ggaagctgcc 
tccctcagac 
accctacaca 
cagagctcat 
tgccctttga 
tagtctgtaa 
cccctcatta 
tggccctgtc 
gcctcccaca 
tgacccagct 
aagaggagaa 
tctggacagc 
acccccactg 
gtgactccac 
aaagtctgtc 
catgccaaca 
cacacacaca 
ggacattgaa 
tgattgactt 
tagactaggc 
taaaggcgtg 
gcccagattt 
atggggaatc 
gagcctgggc 
gtccttcccc 
ctatggttaa 
catgggtggt 
tagcaaccac 
tactgtgcag 
gcctctagct 
ggcccagtct 
gacaggagga 
actagatcat 
ctgtagcaga 
ccctcgacct 
cactgggtac 
agcctggaag 
gcctatccgg 
cagccagcca 
tcctgggaga 
aacagcctca 
attcaaatgt 
aggtttgctt 
ggttggccca 
aggtaatcct 
aaaaataaaa 
gaaaccgttt 
gccttgtgtg 
tcttagaaat 
tagagaaaaa 
aatgtatagc 
gccatccaaa 
gattataaaa 
ttttttaaaa 
aaagtatgag 
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tttgtgactg 
aaagtggatt 
attttcccct 
ttaaataaaa 
tgcctgtgag 
gccgtctctt 
ctttgtgcac 
ctggagctgc 
ctagaagagc 
catgtttaga 
aagaaaatgt 
tgtgtgtgta 
ctccaccaca 
tacacacagg 
agaataagaa 
nnnnnnnnnn 
nnnnnnnnnn 
aacatagctt 
ttaattttta 
gttaatgcaa 
taaaagtcat 
atgcacacat 
taaataattt 
tgctgctttt 
ttagaaaagt 
cgtcagcaca 
caactcactt 
cacctgaagg 
aagccttagc 
tatttctcga 
caaaaaaatc 
ggttaggggc 
actctgacta 
ggttgtatag 
aaagatttat 
gatcccatta 
ttggaagagc 
taaaagttta 
tgtgtttgaa 
ttgattcttt 
cagctgagct 
agaaggaatt 
gtatgatatg 
cagatcacta 
taatcaaaat 
cctgtaacct 
aggacaacag 
gaacaaaacc 
actctgggac 
agcttccacc 
ctagcggaga 
gggtgcatga 
gaagccatgg 
gtaatgaatg 
gaaacttgga 
ttaaaagaaa 
aagcatcaac 
tcgggctggt 
ttcaaatccc 
ttctggagtg 
aagtattaaa 
ttttgttcct 
tgctaaatag 
tggcctaggc 



tttttaaaat 
caaaatgtta 
tgttttttct 
tacaaaagac 
tggcaggaat 
cactccctct 
cagagtgtgg 
acgctgtttg 
agcaaatgct 
ttttggagaa 
ttgcttggga 
tgtgtgtgtg 
ctaaagtcaa 
aagcattcct 
agaagggaaa 
nnnnnnnnnn 
nnnnnnnnnn 
aaagaattct 
atgggaaaaa 
acaggacaga 
ctatttatac 
acaaacagag 
taagctctaa 
gccattcttt 
aaggagcaca 
cagcaatgtc 
ggactggtac 
ctgcagcagg 
tgcagtggaa 
agtgaatgta 
caacttttgt 
acaagctc.cc 
ctacacctgg 
caaattttta 
ttttatgagt 
cagatggttg 
agtcagtgct 
aaattaattc 
gtatatatgg 
cctttcactg 
atcttgtggg 
catgtaattg 
gtaagaaaaa 
tgttcaagga 
gatcccatcc 
caagtccccg 
gatatctgag 
aacaacttca 
acagtgtcac 
tgggacacaa 
ggctgcaagg 
tgtgaaactt 
ccaacaaccc 
ctatgtctaa 
cagaaaagcc 
tttcacaaaa 
taagaggagt 
gagatggctc 
aacaaccaca 
tctgaaggca 
aaatatgcaa 
gttgtttgtt 
tggaagatga 
tactctgcct 



atgaatgcat 
agaatggttg 
gtctttctaa 
aaataaatgt 
ccaacattgt 
gtaattttta 
gtgcacattc 
gatcttctga 
cctagccact 
tttgcttgta 
tacctattgc 
tacactggag 
ttctatgaac 
ctgactttct 
tgctcaaaac 
nnnnnnnnnn 
nnnnnncccc 
agagaaaaaa 
gtaatatcaa 
ttatcttggc 
gagatgaaat 
tctttttatt 
aatttaaaaa 
ctgataaaag 
tcctatgagc 
ctggaccatt 

ggggtgaggc 

ccagttccgg 
cctatgcaaa 
caggcaccat 
gagacaggct 
aaggagtatc 
cttttaggtg 
ttcaccaagc 
acagtgtggc 
tgagccacca 
cttaaccgct 
tcattatgca 
ggatgtagtt 
tgggtcctcg 
ccaccaaact 
aaatagaaaa 
ccaagaaaaa 
tggtgtccca 
agacctgcta 
ggcagcacag 
aaggagtccc 
ccatgatgaa 
acactacagc 
tgtcacacac 
gtggagggca 
acaaagaacc 
cagcactctg 
agagatgata 
aaatgtaaca 
aggcacagaa 
caggaacagt 
agcggttaag 
tggtggctca 
gctacaagtg 
taattggaat 
ttccgagcct 
ccttgaactt 
gactcaggct 



acttgtaata 
tttttgtatg 
tttttaaata 
tttaaaattt 
cttctgaaag 
aaaaatatat 
tgcagaggcc 
tgtggatgct 
aaagccatct 
ggaggatggg 
ttcttgagtg 
ctaggaatca 
ttcattaatt 
gactgtcagc 
ctgtcactct 
nnnnnnnnnn 
tattttacca 
aaaaaagtcc 
agaaatccat 
caaattaaac 
gtgaattagc 
cttaagccta 
atagtgaaac 
tcccaaaggg 
tggtgacctg 
tgtgaggagc 
tggcgatggg- 
caggttttcc 
cacacaggaa' 
ggcacgcatc 
cccttcattg 
tgctctccta 
ggttctgagg 
cagacaaccg 
tgtcttcaga 
tgtggttgct 
gagccatctc 
cgtgccacag 
atgtgacggc 
gagcgtcaat 
tatttttaaa 
gcagagttga 
acctctacaa 
gaaaagggtc 
cagagaagct 
cagagctgtg 
agtgagcgtg 
catcatgaag 
ttccacctgg 
tacagcttcc 
ggtacaaggg 
aataagttag 
gaggcaggca 
aagtcccatg 
aatgcatggt 
tgtaaggtga 
gtattttaac 
agcaccgact 
caaccatcca 
tacttacata 
tattctttgc 
gtttcatgta 
ctgattcccc 
cagtgaattt 



tattttttaa 
gtgggatagt 
ttgtgcattg 
ttactctttt 
cagctagcgc 
ttgtatgtta 
agaagagggc 
cagaatcgca 
ctccctctag 
ctacaccaag 
tgtgtgtgca 
tatccagggg 
gtctgaatcc 
cagctaagga 
ntggggnnnn 
nnnnnnnnnn 
agcattgcag 
actaatgttc 
gtctacggca 
cacagtaaca 
gtgcgcgcgc 
ccaaataact 
cccttcaggc 
tgtcataatc 
agttttacac 
cccacgctgg 
caccactttg 
aaatggcttt 
aagggcagtc 
tggaggtcag 
tcttcccacc 
agggcactgc 
atttgaacgc 
ttttttaact 
cacaccagaa 
gggaattgaa 
tccagccctt 
ggaagtgtaa 
cagaggacaa 
agggtcagca 
aagctagaag 
agagggagga 
gtaaatctag 
ctaaaacaaa 
ggggggaagg 
cccaatgatc 
tagcagaagc 
atgtgtggac 
gacaaaaagt 
acagtcagat 
aacaggggtg 
gaaagagaga 
ggagggtctc 
agagttacaa 
aggtggggaa 
aagcgagcga 
tacctttata 
gctcttccaa 
taacaaaatc 
taataagtaa 
aattctacta 
atccaaactg 
tgtctccacc 
tcaaacactg 



aaaaacactg 
acaattgtga 
ctttcatatg 
atgagtgttt 
taacttctga 
tattatgtgt 
atcagattcc 
ctcaggtcct 
tcctcattgt 
tgccaggtga 
tgcttgtgtg 
ccttttcaag 
acctactctc 
ggtgtggctt 
nnnnnnnnnn 
nnnnnnnnnn 
gataaataag 
aatgttttaa 
tattaatgtg 
cgttttgaac 
aggtgcacac 
ctttaagcag 
tatgacagaa 
tgtatccttc 
ccaagtcaca 
tcctgagcaa 
gactgtgtcc 
gaggcaccat 
attcccgttt 
atgataatct 
acacttgcca 
catcagagac 
agatcatcat 
ttttttaaaa 
gaagacatca 
ctcaggacct 
atttaacttt 
cataagcaca 
ctctgtgaag 
ggcattttac 
ttggttaaag 
gaggaggtga 
tcgtgttcca 
gcccaggaag 
agggagaggc 
tccgtgatac 
cagaggcctc 
aaaagggtgc 
cacacactac 
tatttttctt 
agtgggatcg 
tgaagacata 
catgccaaag 
agtcagagga 
aggtggggat 
ctaccagata 
aaaaaaatac 
tggtcctgag 
tgatacggtc 
ataaataaat 
tgaatggagt 
gcctggaacc 
tcccaagtgc 
cttcaacctt 
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gacccctaac 
ctggggcttg 
ctgctgctgc 
gtgcaaaccc 
tggctcttag 
tttttgtctg 
gctttcagtt 
agtaaagtca 
tcatcaacag 
gaaaacctaa 
gtttacaaag 
tggtggaaaa 
gcagggagag 
aggaggggga 
tgtatttgcc 
aattgcctac 
tggctctgaa 
caaaggcagc 
acaggcagga 
cctgctgggc 
tttggcaaaa 
aatacattca 
acagagagaa 
agcccttcct 
cagtctatgt 
tgaaggaaag 
acctatcttc 
gcacctcacg 
tctccttcca 
cctgctgagc 
agcctcattc 
gactcaattt 
gacccgagtt 
gggagcagag 
gttcagtaag 
ccgctgcagt 
ttgaagctgg 
acagtaaaca 
tttaatccca 
tctacaaagt 
aaaaaaaaaa 
aaattttcat 
aagtctccaa 
agcagtcgct 
aagaggacca 
tgggatgcgt 
ggaacaagag 
catctttaac 
agttcatgta 
ctctaattat 
agtaaccttt 
gaattccaga 
agctgcccac 
aaaaccattc 
tgcaaatcaa 
tggtcccaag 
agctatcatt 
caaacccacc 
agcctgccat 
ctctgccagg 
ggtgaggccc 
caatatttct 
tatgaataat 
acagcacaga 



ttccatctct 
ctggctggtg 
tgctgctgct 
agtcaagaga 
aggagcagac 
cgcatagtaa 
ttaatgaggt 
ctgggctcaa 
gacggtgtgt 
aagcagttca 
ggttctaccc 
tatgagttga 
atgagaacag 
ccgagcacgc 
cttcaccctt 
cactatttct 
ggctgaagcc 
taaaaaaaaa 
agtctccctt 
ctcctgaggt 
ttactcttaa 
gcacaaagtt 
aaagtgagac 
ttacagctca 
gaagcatttt 
tagaaatgag 
cagcgaagtt 
aaggtgcttg 
ccagagtcct 
cctttcccca 
tcagaaccat 
ctataaaaag 
ttgttcccac 
gtaggcagat 
ggagcctgta 
ggcatctgct 
aggtcaaggc 
tatattttac 
gcactcggga 
gagttccagg 
aaaaaaaaaa 
tttttataaa 
cttgataatc 
aagtccgggt 
gaagttcaag 
gagacacacc 
ttaagtgcgt 
atctttggct 
cttcatcatc 
atagaacata 
atctcttgca 
gcattgtaaa 
aagacctcaa 
tacaaagcag 
gctatcccag 
ccatgaggga 
cttacagtct 
aacttcccaa 
ggtagtgcac 
agttccatgc 
tgtctcaact 
cagttcctac 
aagtatatta 
tacaggacac 



tgggtcccat 
gtgtcgctgc 
gctgctgccc 
tttcaacacc 
tgcaccacgc 
agactggttc 
tctccctgaa 
tctgtgatac 
tcgtggcagg 
attgtctctc 
tttcccccaa 
aagagtctca 
agttcagaag 
ttctaaagca 
tatacaaatg 
aaaataacga 
agcctcgttt 
aaaatgtccc 
gccactgagg 
cccaactcac 
gtagctcaag 
gagaagatta 
acctatttac 
gccttcccat 
taaaatcagc 
cacacagtga 
cttgccttct 
cagctatgtg 
agggatggca 
gcctctcctt 
ttctgctgcg 
ataactgagg 
atggggcgtg 
cccaggcctc 
accacagaag 
catagccaca 
cagcataggt 
tacaattaaa 
ggcagaggca 
acagccaggg 
aaaaaaccaa 
aattacagtt 
tgaggctggg 
agtcccttgc 
gtgctcctcc 
agtaacagca 
tactcccgta 
gagaagagaa 
atgcagttaa 
agatgtggta 
aatctggtac 
tagcagcaat 
atcaatcagt 
taacgtaggc 
gaagccagat 
actagattac 
attatggcaa 
gcaagcattt 
acctgtaatc 
cagcctggtc 
cccaaaccac 
tgacagaaaa 
tacagtacct 
acacacagct 



cttacccatt 
agaggctggg 
atatcctgga 
agcagtaaat 
tgcacaccct 
catttccctt 
gccttgccca 
taccactatt 
atccattaga 
aagtgctttg 
acaaagcaac 
ggtaccgtga 
cagccgtgtc 
gcaaataaga 
tttaccaagt 
gtttatgggg 
aggctgctca 
agagctcctg 
ctctcccttc 
cgggcatgct 
gaatgagacg 
aatagaattg 
ataggagagg 
tatcactcaa 
aagagaaagg 
cctgctgcat 
ttaaaaagaa 
gagggcagag 
caaaggcacc 
ttctacagtc 
catgtctgac 
agccggcttg 
cgtgctagag 
aatagccagg 
acactgtgcc 
gctacccagg 
accataggta 
aaaacaaggc 
ggcggatttc 
ctacacagag 
aaacccaaca 
ataattttta 
ggtgtagctc 
ctgtatccca 
tcttcaggta 
actactgcct 
actgtctgca 
aacctggtat 
tttgtaaaca 
attaggaaaa 
aagagacagt 
tgcccttcaa 
ctatagctaa 
tccgtttata 
cgtaattctt 
agcaggctcc 
gtgagctctg 
tcttaacaaa 
tcagcatgtg 
tagacagttg 
tgaaaacaag 
tggacacaat 
agtttaagtc 
gcaagcgtga 



cccagactcc 
gaagggactg 
gatcaaaaca 
accttaggaa 
gtgtctgcaa 
tcaactgttt 
tttctccttc 
tccatagcaa 
agagaaagca 
gcttcaaagg 
cttattttgc 
agtgctagtg 
aaggcaacag 
cgcgccaccc 
tgaagaatgt 
tttgatttta 
cctggagtgc 
aaacactaaa 
tccaactgta 
acatgcccat 
taattgtgtt 
attatgcttt 
ggcccaggct 
ctcagacacc 
tgagttgctc 
aagacacagt 
tgtggtattt 
accatcctga 
gggcttgaca 
tccaaattac 
tgaactggat 
gttggtagag 
aggtgaggac 
cagcccaacc 
gtgtaaccac 
aggctgagcc 
gacccccatc 
cgggtgtggt 
tgagttcgag 
aaaccctgtc 
catgtaccta 
gttgaacata 
atttagtaga 
gtgcttgggg 
atgaggagtc 
ggagtgctca 
caatgatgga 
tcctcagctc 
accaatcctg 
gctactaatc 
cccaaatcaa 
ttaacgcatt 
ggaaaaatct 
ataacctgtt 
aggctctgct 
gccctcggtg 
ggcagagaaa 
cacaaagaat 
ggaggcagag 
caagatcaac 
taagacgata 
taggctgggt 
tgagcaagat 
aggatttaaa 



cattctgtgg 
cttgctgctg 
ggccaattca 
aacccacctt 
agggctcact 
agaattgaca 
aagaacctgc 
caaatcgatg 
gggtggtaag 
aaaaggagac 
aactgacagc 
aactgtcaca 
ggagataaaa 
tgtcggagtg 
taacattgta 
cttctgtgat 
ttgtagatga 
actggtgcgc 
agttctaact 
ggaattcatt 
gctggagaca 
agttcatcct 
actgacaatg 
cagcccagtt 
agtagtacac 
ttaaaaggtg 
gctggtatgt 
gggaatcggg 
gatgccttta 
ctgaggtagg 
atggtcactc 
aaaacataaa 
ggactaggca 
tgagctctaa 
ggcaagcggc 
acaagactca 
tcaaagttga 
ggcgcatacc 
gccagcctgg 
tcgaaaaacc 
ttctaacata 
atacaatgat 
gtacctgcct 
aacaggcaga 
tgagccagcc 
ctctggacca 
gacagtacgt 
gttctggcta 
gcagcaataa 
cacttaatag 
atgatggcaa 
gtaacgcagc 
t'tctaaagcc 
ttgggccacc 
ggctacacac 
acctgctcat 
aattcacaaa 
aaataaatag 
gcaggcagat 
aacgctatat 
tggatcaatg 
ataattccaa 
attatcgcca 
ggcacccatc 
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cctacagtat 

catttttgct 

aattcttact 

gcgctggact 

ctttgctgtg 

acaatactcc 

aaaaaagcac 

ctcacagaaa 

aacaagtcca 

gacagacctc 

taaagacaca 

gctggggtac 

cattcccagt 

gtgtacccag 

ttgggttcaa 

aaccacagag 

gggagggggg 

gtaaacaagt 

aattttttta 

ccatgtgggg 

taaagaaacc 

taggagccca 

aattaaaaca 

gttaagagca 

agctaacgcc 

gcccgcaaac 

gattctttca 

ctctgcagac 

ggaattaaag 

aaagcgtttt 

acataattaa 

gattcactca 

atgaagactt 

tctttgtgtc 

cttcctacct 

tggactacaa 

tataagtgtg 

caatattagg 

cttgaagcaa 

acatacacac 

gcggtgagga 

gacaagccag 

atgccaggga 

agaagctgta 

gatatgggtc 

agcacacacc 

tggtccacca 

gtggctgcaa 

agaactggga 

acagaaggag 

aaactggtgc 

aaaaagacta 

gaagcaagcc 

tgttaaaaaa 

aggaggcaga 

agtgggttcc 

agagaggaag 

cagcgttatt 

aagatgaaaa 

gctcattctt 

gactagagtg 

aactggagtt 

aggcctgaag 

ttacaaacag 



atgcagcatt 

caatatggca 

agcctgggct 

agcttccaga 

gtgaaaggtg 

tggcttggta 

agagctcaca 

tcaagaagaa 

cttcagaggc 

actcagaaca 

gaagatgggc 

gctgtaacaa 

taacccagca 

gaggtaattc 

attaggatca 

ttggttaaat 

ggagggaagg 

ttcaggctca 

aagaggacag 

tacaggagca 

ttaaactgtg 

cccattgcac 

aacattaatg 

cagcgcccat 

ctcttttggc 

attcaaagaa 

tttttttgag 

caggctggcc 

gtatttaaca 

gttgtttttt 

aaatgccaat 

cacaacttat 

ctggatacat 

agacattagt 

attgcagtgg 

ggcagcaagt 

tgtgggcact 

catcaagagc 

aagaaactca 

acacacacac 

actgtaaaaa 

ggcctctgcc 

ccactctaac 

gagagttgat 

agccatgctc 

ggacctggca 

tagtggctgc 

agaggacaaa 

agtacccgct 

agactggtta 

ccatcactac 

cagagtacgc 

agactggaga 

aaaaaaaaga 

ggcaggtgag 

aggaaagcca 

tggatccaag 

aaatgacagc 

aataaacaca 

gttatccaaa 

agacactgct 

tgagggaggg 

cccgaaggtt 

tttaaacaca 



gactgtctag 

gcagtaataa 

gtgtttgctc 

ggatgtaaat 

ctgtttctga 

acaatgccaa 

tctgagccaa 

aaacaagcaa 

caaaaaccaa 

caagctatag 

tgctggcatg 

gcagaggaga 

atcacttctg 

actgcggctt 

agcacacttt 

aaacaacaga 

gggagaagct 

gtcggggtat 

aaaaaagaca 

gcatttgcat 

ttatctataa 

gccatagttt 

cttgttaaag 

atggaggctc 

cttcaagagc 

aaatacaaac 

acagggtttc 

tccaacccag 

cacacattat 

gttaagtttc 

gtgaaataaa 

atacttaaat 

tagaaacgta 

gtgagtgtct 

ttctcgctaa 

cccagcaatc 

agccttgtta 

tcttaactgc 

cagatggtca 

tcacacacca 

taaggtgtca 

ctcatcagct 

atggagctgg 

ttgatggatt 

tagtgagtgg 

agtcaataaa 

tcgtggttgg 

agtctctgga 

gggacagaag 

tcggaggaaa 

aacagcagta 

accctggcta 

aaaggaaaca 

aagccagaca 

gcaggaggat 

ggcaacaaag 

agcaagagag 

agaaaagatg 

agttctaaat 

caaactaaag 

ttaaaaaagc 

agtgggcccg 

gaactgagaa 

ataccagcaa 



ttttatttcg 
aatgtatatt 
actaactcca 
ctaactttgg 
agccacgaca 
aaaataccaa 
aaaaaccgac 
acaaacagca 
aaggcaaagg 
ccatgatact 
actgggaaga 
atgtctaact 
gcaaaaaaag 
ctggtggatc 
tatagtcaga 
acagccacac 
agaatcccag 
aaggtaagac 
gcgtagagaa 
ttggttactt 
agggcagcag 
tgattacaaa 
aaacagggct 
tcagccatct 
actgcatgca 
tgctataaaa 
tctgtgtagc 
agatctgcct 
acatatcttc 
aattaaaaaa 
taatatatac 
acaattttca 
ccctgaaaat 
aaacttgcat 
acctggaacc 
ctcctgtctc 
catggctgct 
taagccatct 
gagtttacac 
aggcttagtg 
ggaaagctct 
cttagtaact 
gatgggctcg 
ctaaagtagg 
ccccacaccc 
acaaaaacaa 
gagctgtccc 
actgatcaac 
cagactctga 
tctgaaacat 
attgaattgg 
tcatcaaccc 
aaaagttatt 

tgggggtgca 
ccatgagttc 
aaaccctgtc 
agagagcaga 
ggcccgacca 
atcattttaa 
caggggtggc 
aatagatgca 
ggaagcctgt 
catctcaaga 
taaattgtca 



cacactttga 

tgcattttga 

gtggacttct 

ttctcggcct 

gtcccatggt 

aaaaaccaaa 

actccctatt 

cataacaggc 

ggctcttaaa 

gctcttca'tc 

gaccagtgtg 

gtgtctgcca 

cccactctgc 

ttttttcctt 

gcacggagac 

cagaactgct 

cattcaggat 

tttatttcac 

ctagctatct 

ggcgaatagc 

tgggtgcaca 

gtcacgcagc 

gcagagatga 

gcttctccaa 

catggtacgc 

cccatatatg 

cctggctgtt 

gcctctgcct 

cttcttcttc 

ctacatagtt 

atatataaca 

caataatgaa 

cgcaaatgac 

aaaggctctc 

tggggctcct 

acctttcttg 

gggactagaa 

ttctaccctg 

acacacacac 

accactgtga 

gcacaaaatg 

atggttgcct 

agaggccctg 

gggaatcgat 

acccatgagt 

aaacaaacaa 

gcacttatgg 

tctagagtct 

aggtgatcag 

aactcgacgc 

gcacaacatt 

aggcgattct 

taacaaaact 

cgtttttaac 

gaggccagcc 

tcaaaaatca 

gagagtgggg 

atgacatccc 

taatggctgt 

cgggatttac 

cgctaaccat 

gcctataaac 

caaagcacag 

gctttatgac 



atcatccatc 
tgcatggtgg 
gacgtaagag 
cccctgaagc 
ggtttgagta 
accaaaacca 
ttttgaagaa 
taacaacaac 
catttacaag 
catcaattct 
ctcactcatt 
atcacaggca 
ccgcacactg 
cttgcagtgc 
aaaaggatct 

ggggggcggg 

gcagaggttg 
aaaaataaat 
tttatgtaaa 
cctggtagga 
atgcagtggg 
tctactcaaa 
tggctcagtg 
ttccagggga 
ttacctacat 
accatcttaa 
ccagaacttg 
ccacagtgct 
ttcttttttt 
ttataggcaa 
ttctgtaata 
aagctttgaa 
ggttttcatt 
ttctctatca 
gttccttggc 
gaaccaatgt 
ctctggtctt 
attagaattt 
acacacacac 
aaagggaagt 
gtgtcctctg 
acagcaaacc 
ttattaaagg 
ttcctttatc 
atgtgtggac 
ataaacgttc 
caggaagaca 
gcttgttatg 
gacagagatc 
atactggtcc 
cagaaaacag 
ggcactattg 
tcccagagca 
cctagcactc 
tggtctgtac 
ctgactgggg 
gtttggatgt 
agaaatggca 
gtgtctgctg 
ggccagccag 
taatcagcgt 
ccaacctgcg 
gcacaatctc 
agataggctg 
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acaggcatac 
cgacacagct 
cacagcatgg 
tggaaaaggg 
agggcccagc 
gtacaactat 
gaagcacaaa 
agttgagaca 
ctctttacaa 
gccagaggcg 
ggtccaggga 
ctacaggagc 
tcagggggct 
gagttcagtt 
taaacatttc 
aaaaaatctc 
agcaaagcaa 
aaaccccagg 
accgaaacag 
accattcact 
cgatgaacag 
gatagctgcc 
gatccaaggt 
ggaggagggg 
tgctggcctc 
ctcacccagt 
acaaagaaag 
gatgtaggct 
tgctatcacc 
tcctcagcta 
cacatctgta 
acttaaaaat 
nnnnnnnnnn 
nnnnnnnnnn 
tgtcatcttg 
attttattaa 
ctgtgccagg 
gtaacatggc 
gcttacttat 
agcgatggga 
ctaacctcag 
agcagagccc 
ggcgggaaga 
ttgtgtgtgt 
tgcatgtgtg 
catacaatgg 
gaacacattt 
caggaaatgg 
ctttcctggc 
caggcaccag 
aaataaaaac 
gtggatattt 
caaaaactgg 
ggtccaaagt 
tcttctggag 
aaaaaaaata 
agggaaagcc 
aggatcacag 
agaccccggt 
gccacaaaac 
tggtaatttt 
aactcactct 
tggattttga 
ccaaagaaaa 



cacaaagatc 
gagcctagtg 
ccagagtgac 
aagaaggcaa 
atccctctcc 
aaatactcct 
cataacactg 
tcaaacacac 
agtctccaca 
gcggcacaat 
gcactagtga 
ctagacaccc 
gatgagatgg 
cccagaaccc 
tatcctccaa 
tcgtggacag 
atccaaatat 
tcgctttagc 
aaaaggcatt 
cttaataaaa 
gaacacaaac 
agactgggct 
gctgcctccc 
tttgacacag 
aaaactcttc 
tttaatgcca 
atcacacaga 
ggtctcaaac 
catggtggca 
ccatgggtgt 
accccaccac 
aaataaaaac 
nnnnnnnnnn 
nnnnnnnnnn 
aagggttgaa 
aaaacaaaac 
gaacccactg 
tgcttacgca 
aatatcaaac 
aaaataacct 
ctgcagactt 
tgtgcctgtg 
gatggaaagt 
gtgtgtgtgt 
tgtgtgtgta 
ctatgcttat 
ctaaaatttg 
gtaggagagg 
accaactagg 
gcaggaaagc 
ggagcagaag 
tgtgagttcc 
gggctggtga 
tcaaatccca 
tgtctgaaga 
cctttaaaaa 
acactcgatg 
cgggacggag 
ctcaaaacaa 
accttcttct 
ttgtttgctt 
gtagaccagg 
tttgggggtg 
aacacccccc 



gggaagaaga 
acaggccgtg 
agggtcacag 
agagcaccac 
cacctgacag 
tcctagtgta 
taagttacca 
gtttgatcct 
tttataaacc 
ggtggctgag 
attcaagtaa 
tgtaaggtca 
ctcagtggct 
gtatcaggca 
gcacagacac 
acagcaaaaa 
gcgaaaagga 
atccaaaaaa 
tgattattaa 
atgttaccaa 
tggctgggca 
ggcatgagaa 
ccccccccct 
ggtctcactc 
tgcctcagtt 
tttaacagta 
caaacatcac 
ttgcacaaga 
tttcggaagc 
gaggccagcc 
tcaggaggct 
ataaataagg 
nnnnnnnnnn 
nnggaaaaca 
atcactgagg 
tccttctgga 
ccagcagaca 
gacaccacac 
ataagtacca 
gcacatggtc 
gtgggtactg 
ccagggaacg 
ggagctaggg 
gtgtgtgtat 
tgtatgtatg 
gtgtctttta 
gaatacttgt 
caggagagat 
tcactcacaa 
ggccccacat 
ctgggcctgg 
gggtctacat 
gatggctcag 
gcaaccacat 
cagcaacagt 
aacaaaacaa 
gcgcaagcgt 
gccagcctca 
aacaaaacaa 
tcccaatcct 
ttcaaggcag 
ctggcctcta 
tgtgggtgat 
ccccaaagta 



aacgggattc 
ataagctcaa 
caatacagca 
gagagaaaag 
agaaaaccag 
gacagagttt 
gaagtttaga 
aggggaatta 
aaatacactt 
gcagcaggat 
acattaaaca 
gtactactaa 
aagggcatgc 
gttcgcccac 
acataaacat 
atactgaacg 
taatacaatg 
ttcaatcatc 
taaatacagg 
acaggagaaa 
cagtggcaca 
ccaagggtaa 
gtgtcactta 
tgtggcacag 
ccagagtgag 
caaggttaaa 
acaggtgctg 
aaagaatact 
agaggcagga 
aaggcagcac 
gaagcaggaa 
ttggagagag 
nnnnnnnnnn 
cccattagac 
aatggctaat 
ggagtactgt 
cccaacttca 
acacttcctg 
tgtgagcgct 
agtaatgggc 
agaacggacg 
gaggacgctg 
cccgatgacg 
gtatgtgtat 
tgtgtgtatc 
ccacaagtaa 
tggggaagct 
ggcttagtgg 
ccaggatctg 
ccctgcaggt 
aagtccttta 
agtaaaaact 
tgggtaagag 
ggtggctcac 
gtacttacat 
aaactcccac 
gtgatcacag 
gctacacagg 
aaaaagcgtt 
tgaatgctat 
ggtttctctg 
actcacagag 
accacaggga 
tgaacatcaa 



ttgcacaaca 
aacatgcaca 
acagagacag 
ttaacccgcg 
gaggacgcag 
accaaagggt 
cacgttgaga 
aattattggt 
gtatcagaat 
caagtctgga 
ttaaataaga 
gataccaaaa 
actgctcttg 
ctgtaatgta 
aattaaaaat 
aaatctaatc 
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gaaagcattt 
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gctagcctga 
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cgctgcctga 
tttgagacta 
tttgccatca 
agatcagtag 
gtgagccagc 
gttcaagcca 
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tgctgaagag 
gacaaaactg 
gctcccttca 
aggataatca 
tatgctgggc 
acagggcctg 
aatccactcc 
tgactaagtc 
ctcaatgggc 
tgaatatggc 
gtggtgtgta 
caaaatggtt 
aaaattttaa 
atctgagccc 
ttaacacaca 
acagcctctc 
aaaacatttg 
cctccagcac 
tgtcgccaag 
cacccgactg 
aaccatccgc 
atattaataa 
taaaataatt 
tattctgtag 
ccggtctggg 
cacattatca 
cgaatttggt 
tgtagccctg 
atgtaattct 
cacttgggcc 
ctgtatataa 



ttttacaaat 
ctgcaaacac 
taagaaacat 
attccatatg 
ctgaaacact 
atcgtaatct 
atttaataga 
aacagaaaag 
tgttgtctga 
gaagcccctg 
aagacggatt 
ccaaactgac 
cagaggacct 
ccagagaccc 
aaatcttaaa 
aggtagactc 
cttatcccag 
gtattagcta 
gacaacattg 
ctcctcaacc 
ccagcaccca 
aacacttaat 
taaatttctg 
atggagtcaa 
catgagccac 
aaagaacaag 
ggtctctagt 
agatcaaggt 
tacaaggtca 
tgtggaagca 
cacaagagcc 
nnnnnnnnnn 
nnnnnnnnnn 
actgagaaat 
cttaaggaga 
gcattaacta 
agcttgctac 
atcacctcta 
gccaatgaac 
ccagacactt 
aattcacaga 
agagcttcag 
taatgccata 
tgtatgtgtg 
ataaaaatcc 
gtaatcttag 
cagcatgaat 
catggtggcc 
ctggcctcct 
tactgaaact 
cgggaagcag 
taaaacaaaa 
ctcttccaaa 
aacaagatcc 
ataaatcttt 
ggaaaagtcc 
gtaaggcaag 
ctacagtgtg 
tatactcaag 
tacttttttt 
gctatcctgg 
ttttcaaagc 
ccaaaccaaa 
aactaacagt 
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tcatataagc 
tctttcagta 
atgtaaacga 
ggtcagggaa 
cgtcctcacc 
tgcccaagcc 
actttaaaag 
aagaccttgt 
acaatggcaa 
agggaaggac 
actagtctat 
ctctgcatta 
tgttcatttg 
aaccgcatgg 
ggaactcggg 
cccaattcac 
caggctagct 
gtttccaccc 
ttcacaccaa 
atatgcactt 
tttactaggc 
accagctctg 
tctgtgcaat 
ggtaagtcct 
tggcataagg 
tccccaaggt 
acccactggt 
agcgaataaa 
taaagtaact 
taacggttcc 
acagtatcta 
taggtgattc 
ggccgtgcta 
gcttaaaatg 
ccaagtaaac 
ggcttcgcac 
cctcccttat 
aatctgcccg 
ttaccccgta 
gctgcccagc 
ggcctatctg 
acaccgtagg 
tgggaaattc 
tcttattgca 
ccaccgctgc 
aagcaacaca 
gtgtacatcc 
aaatcacacg 
ctttttccat 
ctctgtctct 
tctccggaac 
cctcccaagc 
tatagttatt 
gtgtgcagtg 
cctactgtgg 
tgagttatct 
tcatataaac 
cccctcacca 
tactcactga 
tctttaggga 
agccattcag 
agacaacctg 
ctcacaggct 
atgactaaga 



taagtagcct 
gatgccattt 
aaatatgtta 
gcacggctgc 
agcaccaaga 
tgaaaatctg 
tcaattccag 
gggtgattta 
aagctcctcc 
actatacaca 
tttttttttt 
aataaaccta 
ttttgactgt 
agcaggtctt 
tggtcagatt 
ctgtttactc 
ttgaactcac 
aagtgctggg 
agtggcttcc 
gcgtttctct 
tgaagcacac 
atgcaaaaac 
ggcattccct 
cactcagtgc 
ccagcattat 
acatagtgtt 
tctcttgaaa 
taacaaaagg 
tccaagggag 
cccttatgag 
atgaattaca 
tgaccaagcg 
gccctgctac 
tggtcgaact 
ttccgcagat 
ggcaagtttt 
caatatgctt 
catacttact 
acaagtgctg 
agggcaaagt 
atttcctcag 
cttctcacaa 
ccggtcacac 
gtcctgggtt 
cacagtgctg 
ggtctggaac 
tagctcccac 
tgctacacgc 
tttggagacg 
atagagtgcc 
tcactctgta 
gctaggatta 
atttgttaca 
tgcacatgtg 
cgtttgggga 
cgacactgac 
cacagcgtgc 
tgtgagtgag 
gttcccttgc 
tcagtaagtt 
tcagtctcta 
aagacggaag 
tgacaggcaa 
tcttctaaat 



gcaggtacat 
gagaaaaaaa 
aatcttaaaa 
aggcccccag 
acagccaact 
atatgaaaac 
ctatctgtgt 
aacagcgtgg 
tccgtggcct 
ggtaccaatc 
accttagcta 
gtgtataaga 
gtggtgtgca 
ctccttccac 
tgcttggcaa 
taatgtatat 
tgtggacact 
acaacagaca 
tatgaccttc 
ttttttactt 
ttcaggagac 
aaaaaacagc 
gacgcattac 
gcgcatccac 
cacagcgcgg 
ctgtccaagg 
atctgcaata 
aagacacccc 
aaatgacatt 
gaagtattca 
ggagccactc 
taggctatgc 
atgcaggtaa 
caatctgcta 
gtcggagcat 
taatgagcct 
gacatctttt 
gggcagccac 
gggttacaga 
cttccttatt 
cacaccccca 
tccacactac 
ttctcacagc 
gggcaggcct 
aaatagtccg 
aggcaactca 
gctcacgagc 
agaacccaaa 
gcggctcact 
tggctaccac 
gaccaggctg 
aaggcatgtg 
tatgtgtatc 
tgcaggtcag 
actcaggttc 
atgtaatgat 
atgtgcaggc 
ggagaggaac 
tggccaagca 
tttccccaag 
ctgccactcc 
gtgagccaat 
ttcttggact 
ggaaaaatgt 



ggttatggaa 
aaaaaacaaa 
aaccttaaag 
agcagcacat 
ataataagct 
agagggcaag 
ggaaggactt 
agggcagaaa 
ctacacgcat 
cccacgaact 
gtgtctttct 
aggcaaatga 
tgccaggcgc 
caccccaggg 
gggcttcact 
atttttgaga 
agctggcctg 
gcaaccatca 
tcctgacaac 
caagctttga 
accctgtagc 
gatggggatg 
ctcacaccag 
cacacccaaa 
acaaacacaa 
atggcctctg 
tttgcctaaa 
aaggacgagg 
gccgccacac 
cttctggtca 
aataccttta 
aaatgctctg 
tgctacgcca 
gaggtgtgag 
cagacagagc 
ccatggtaat 
agttttttga 
agctggccct 
catatgccgc 
tacaaagcag 
agggtctcac 
tcaccgataa 
ggtacaaccc 
ggtacataca 
cctgcaggcg 
aagctgctct 
gatatgcaga 
gaagtaacaa 
agctagcctt 
acctgggcac 
gcctcaaact 
ccatcagcgc 
agtgtgtatg 
aggacaactc 
caagatagca 
tcagtgtgca 
taggaaacaa 
tcaggctgtc 
ttatttataa 
acaagccaga 
tcaacctgtc 
aaaatcttac 
gaatgcgttt 
ctggttaagt 



acagctttat 
acattctttg 
cagacgccac 
tccctgaggg 
cataaaaaat 
acagataaaa 
ctcgggacgg 
aagaaaggaa 
ccaagcgata 
agaaaacaca 
tatgtttggc 
gtaaaggtaa 
attcgtggag 
gtcgcagggg 
cactgaggcc 
cagggtcttg 
gaggtctgag 
aggcaaaatg 
cccaaacagc 
ctccactttc 
tctggagtga 
gggcaaaggt 
atacatacag 
gcaagcattc 
agccaacctt 
cagactcctg 
aaacacaact 
gaagattatc 
acagcccccg 
ccccgtgttt 
tcataaagtg 
agcacactta 
gtcttacgcc 
tttagacctc 
tgtcccctcc 
agtgggttcc 
gacagcaaat 
gaactcctcc 
catgttcagt 
cagccaagcc 
cgatggcagt 
gatcatgcgg 
attctgctgg 
gttctctttg 
aaaggaagac 
tccttgcaag 
atcactaact 
accggcactc 
gaactcagag 
tttgttttcg 
cagaaatcca 
ctgacccaat 
acgtccatat 
tcaggagtca 
ggaaaagtgc 
cacagtggtt 
cctgtgggag 
aggctcggtg 
gatggtgatg 
aaaacctttt 
tgtggggcag 
tgacagtaac 
aagagaatgc 
aacccagcaa 



catacagact 
gattttcaac 
tgctctttgg 
aagccttgtg 
cctgaaaggc 
ggcaaactat 
actatctgct 
agtgcaggaa 
tggagggagg 
gcttttacta 
ataatttctg 
agtcagttaa 
gtcagaggac 
tcccagggat 
ccccaagggt 
ctatgtagtc 
tgaacctcct 
aagtcttccc 
aagtgcccgg 
ctaaaaggtg 
ccggaaacac 
ggcttgtgct 
aaaagacaaa 
aacagctcag 
ctctttgagt 
gaaacacgtc 
ctattatttc 
ttttcctact 
actctttggt 
cacttttagt 
ggctttgtgt 
ggggaggctg 
acaggaagaa 
gggagacggg 
tgatgcaaaa 
tctctctcct 
aacgtagccc 
tgcgtctcct 
gtaagtgacc 
ctgcagccca 
cagtccatga 
tatttgaaat 
tcaatcactt 
cggagaaaca 
cgccatcagc 
tggtgagcgc 
ctggtttcag 
acgcacctca 
tttggtctgc 
agacagggtt 
cctgcctctg 
tctttttatt 
gtatatgact 
gttctctcct 
ctttaacagc 
ttgcatgtag 
ttggtgatct 
gcagtgcctt 
tctaccctta 
aggcccaatg 
gacaagccac 
agccagccag 
agaattaccc 
ggagctaagt 
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cacgcaagcg 
aatcatgtca 
cctgcecccg 
tatacagacc 
aggcatttgc 
catttaaaga 
agtggaagag 
cgaatacata 
agctggttgc 
ccaggggatc 
tgcaggatgc 
gtggatttga 
tggaacatgc 
gaaatagctt 
tcaaagccca 
tccgttgtct 
gatgtttaat 
ggatcgagca 
aataatggag 
tccaagagaa 
cagtggttaa 
gtggtggctc 
ggctacagtg 
aggaagagga 
gccagtctgt 
gctacattat 
caggcctgag 
aactcaagcc 
cctggcccag 
cggcccctcc 
caaacctagg 
tgtgctgtga 
cacctagtcc 
ccctcctcct 
aactgggctc 
cccctccccc 
aagggacatt 
gcccaatcag 
ctcagtgaga 
taaaattgac 
gaaggtagca 
ctacacgatt 
ccaacagagt 
aatccgtctc 
gacggctcat 
ccactcttag 
gaacttgtag 
tgtgatcaca 
ggggttgtgc 
gtctctggtc 
agacccacac 
taaatattgt 
ctgtttttct 
ggtactacag 
ctcaatcctt 
cttaaaaagc 
tgggaatgac 
cagtgacaac 
attctacccc 
cagcgcgcag 
taattgcttt 
ggctttcagg 
ctcaatagca 
ctctggttta 



gtggatacct 
agtacctaca 
gacccccacc 
agctgttatc 
cactatgcct 
gacgaagcta 
cttgcttagc 
agaaggctta 
ttaatttccc 
tgatgtcctc 
ctcgggccac 
gtttggtccc 
atgctctacc 
ttaggaggat 
atttaaaagt 
gtctttagtt 
tctcaagatg 
cacagtgaaa 
cagtggcgag 
cgcaggttag 
gagcactgac 
acaaccatcc 
tacttacata 
ccactgagca 
ggcatggcag 
gtccctctac 
gtcccaggta 
cctgccgttc 
gttctcagat 
actcctgtgg 
ggagaaagaa 
atgcaccagc 
actcaagaaa 
ccacccgcaa 
acattgacag 
caccctcggc 
tttgtttcat 
ttatagaact 
atttgtttat 
tatacttctt 
aggacaggct 
ggtggtaact 
agcggtggcc 
accactacta 
caggagtgaa 
agccaagatc 
atcaatgaca 
gttactgaca 
tcagagtctg 
agacttctgt 
cactgaggat 
tattggttta 
tgagtctctt 
ctattcaatt 
ctacacaaca 
tagatatggt 
aatgtacagg 
acaacagaag 
cacatcagct 
ctgttaaagc 
ctctggctct 
ccaccatctc 
aaggcttggc 
ccagagcccg 



gctttgcctc 
aaatccagat 
cggggtctca 
aaattcagag 
ggctctcact 
gaaaccagtt 
atgcacaagc 
gaggaggggg 
agtccccaca 
ttctgacctc 
tatgctggct 
tgggacccac 
cgcaaattaa 
gcctaaggaa 
ttgagtgctg 
taagagagca 
tcctgtgata 
tgagacagtg 
cgatattttg 
accaaggtag 
tgctcttctg 
gtaataatat 
taataaataa 
cacattgaaa 
caggaattac 
actggccgtg 
aggacacagc 
tctgggtgtc 
ttagcctcat 
tcagagatgg 
acaaacgtac 
cccgagctgc 
tgcaatgccg 
ggtcagacaa 
cagccatgaa 
ctcaagcctt 
gagtctgcca 
caatcagaag 
ggaaatatca 
gaccatctat 
agtggcagag 
aaagctgccc 
caaggcaagc 
gccgctcaag 
cagagccagg 
taaagatcgc 
gtattttgtt 
gtgaaatatg 
ggagaaaaac 
cctacaatgc 
gccttaagtg 
catcagactc 
aaattctacg 
tccttaagct 
ccctcaggat 
gtggaggttc 
caacaaagag 
atgtgaccag 
gcccacagtg 
cttgccgtac 
ctatgttcat 
agaagacata 
aaaacagctt 
agcacagata 



tgaaccctgc 
ctagcctgaa 
ctgtgtagtc 
acccacttgc 
ggttgtacca 
tagacagctt 
tgttggtttt 
tggtggtgga 
tggtggctca 
cttggggaca 
caggggtgaa 
aaggaagagt 
aaacttaaaa 
cttctctgcg 
tcacgtgttc 
actgggtgag 
gcattataag 
aggaagaaaa 
ggaatacaat 
gaagaggatc 
aaggtcatga 
ctgatgccct 
ataaatcttt 
tggaagatgc 
agtctgttca 
actgaaaagc 
cctgacctgg 
tggagcgagg 
gcaaagtttt 
aaacaccatc 
tgctgacagg 
gccccctcgg 
ttgctccttc 
actgccaaag 
caaaggcagc 
caccccaatc 
aaaatgtttt 
gcattatctg 
aggttaaatt 
tggctaatct 
gacagtgagc 
ccacacagga 
agtcacatcc 
tttaacttca 
agacaggcag 
taatccttgc 
cgggggacag 
aacaccttaa 
caaccaacaa 
aacgctagct 
acagacaccc 
aaaacaaaaa 
acacagctgg 
tgggctaggt 
gaagtgctct 
taacaagctc 
tccattgttc 
ctccagagac 
caaactgaag 
ctggcgggat 
caggggatcg 
atgcctcgaa 
ggcagcagtc 
acccctgagc 



acaggttttg 
ttcaaagata 
ctggctgtcg 
ctcccaagtg 
gaggagcaaa 
tggggctgtg 
aaccctcagc 
gagatgggtt 
caacatccat 
tgcgactcat 
ggtgcttgct 
ttgacttcta 
tttaaagagg 
ttttcaggtg 
tgtatgccca 
aagtaactga 
ttgctgtgga 
cctatattgt 
agaaataaat 
actgggctgg 
gttcaaatcc 
cttctggagt 
aattaaaaaa 
aactgaaatg 
aaacccagca 
agcaacgtgg 
cactaggaaa 
ggtgcgaggt 
ccagttggtg 
taaggcttgt 
aacttcacat 
ctctcaccta 
catctcgtgt 
taccacagac 
aacagcaagc 
accaaaatga 
catgggagaa 
ctgtgttaga 
catttatatg 
gaaggaaaag 
ctgagatgaa 
ctgactccct 
gtcagaggac 
cacaggcatc 
cctctcttat 
ctgggtaggg 
acatgcctgt 
tgtagacaca 
accagaccaa 
atcagcacag 
cagatggatg 
tgaggagcgt 
aaaccacaca 
aaagtatttt 
tccaaggcag 
tcctcatctc 
tcactgtact 
tcagcagagt 
atgctctcag 
caaggtcagc 
tgggtttgtt 
gaagtatagg 
cacatgctga 
tgagttgcac 



gtttatgttc 
ggctacctac 
tagcgctctc 
ttgggattaa 
acaatgtggc 
ggtgtagctc 
gtgacagaac 
agaggttaag 
aactgcagtt 
ttggctcaca 
gccaagctgg 
tacactgagg 
aagctgtaga 
agattcagac 
gttctggctg 
gagtctagcc 
tgacagtgat 
atacgaggac 
gattcaaata 
agagatggct 
cagcaaccac 
gtctgaagac 
aaaaaaaaaa 
caataccaga 
cggacctgag 
taccctggag 
caggtgtaaa 
accttgtctc 
ttgccccctc 
ccttctgagt 
ccttctcaaa 
ggtctcagca 
ccttcaatgc 
cctttcccta 
cacactggca 
accagatgaa 
aatcttttaa 
gatttgcatt 
aaattattat 
aggccaggga 
aacacctcaa 
aaggagactc 
aaggtccata 
ggcaaccaaa 
agtaaaaggt 
ggggtatact 
ttgtgatttc 
caattaacct 
tgccatggct 
ttacacagcc 
ctctaagtag 
gaaatatgag 
tgcccaccct 
ctttgaccac 
agtttaatgc 
ctgagtgctt 
ttaccgagtc 
aagaaagtac 
cactgttctt 
acttagattc 
ggtttaagag 
aaccactcca 
cagtgtccag 
aaaacctacc 



132900 
132960 
133020 
133080 
133140 
133200 
133260 
133320 
133380 
133440 
133500 
133560 
133620 
133680 
133740 
133800 
133860 
133920 
133980 
134040 
134100 
134160 
134220 
134280 
134340 
134400 
134460 
134520 
134580 
134640 
134700 
134760 
134820 
134880 
134940 
135000 
135060 
135120 
135180 
135240 
135300 
135360 
135420 
135480 
135540 
135600 
135660 
135720 
135780 
135840 
135900 
135960 
136020 
136080 
136140 
136200 
136260 
136320 
136380 
136440 
136500 
136560 
136620 
136680 



104 



WO 02/29059 



PCT/US01/31488 



agccacgaag 
ggaccgagag 
gctgccgaga 
aagaagaatg 
acagacctcc 
cacacagcga 
ccctcaggac 
accttccctg 
ttcactgtga 
aagaacactt 
gcaagcacgc 
ctcaaccctg 
tgtgaaatca 
ctcattattg 
cttcagggcg 
ctcagtccat 
agttcagcct 
gaaagaactt 
agggagagca 
ttttgacagt 
gctgcagaca 
aaacaaaagc 
gcctcgagat 
atcccggaac 
cacacgtgtg 
aggaatagct 
gaagcccaca 
cctaagtgac 
gctttcaagc 
aacttaacta 
tgaatctgcc 
ggggagagag 
tgttttacca 
gaggtactgg 
agaattgagc 
ctggcccctg 
ccgcagacta 
ctcggcaggt 
agtctctgca 
tttaggagca 
cagaaggaaa 
nnnnnnnnnn 
nnnnnnnnnn 
atctctatat 
aatcccattt 
tctagaagag 
ggcagcttat 
ataagcgcag 
atggttgtga 
tgccttccac 
cttagtactt 
ttctatcatt 
gcgtgaagta 
cccctgtgtc 
tgtgtgcaca 
ccttgaaata 
gatgagatgt 
cccctcttct 
gcagtgctgt 
aggcaaagtg 
tttgtttgtt 
tagaattcta 
gcaagattct 
caggtattct 



cttataggcc 
gctccgtcca 
gccgtcaaac 
tacaggatgg 
cactcgggaa 
aggtcttcaa 
tcggctcagt 
acatgtccat 
ttttaatcga 
tgtaagagct 
tagtcacgca 
gacgcaccta 
aactgaactg 
aaggtcatct 
ccctttgaga 
caaagcccgc 
gtcagctctg 
gtccacttgc 
cattagaact 
tagacaccat 
ttgcagaatc 
agctcagcca 
gatctggaag 
cctcgggatg 
catgcataca 
taatattccg 
ggaaggcgtg 
ggcttacttc 
tgactgcact 
aatggctaaa 
atcacagtta 
ccccaaagct 
gcatgtatgt 
atcactgaca 
ccaagtcctc 
cattttgctt 
gttaaagctg 
atcaccccag 
gacggagact 
ctttatctca 
gccacaatga 
nnnnnnnnnn 
nnnnnnnnnn 
atgaatgagt 
acaggtggtg 
caatcagtgc 
ttttaattgt 
gcactcatag 
gccatctgat 
ctagaattac 
tcaacatagt 
ctctacaccc 
tgagttagag 
tctgagacag 
gcgctctcaa 
acagatttct 
ttttgaagga 
acagagaaga 
actctaaggt 
cagactcata 
tgtttgtttc 
ttgttgatta 
tcccatattc 
cactgagctg 



tctgggatgt 
ccgaagtcag 
ttgtcagcct 
ttttatggag 
ggagcattgc 
atacttactg 
gcaaggactc 
caaatagata 
atcttcatag 
cccatcaagg 
tcaccccaaa 
ctagtgcgtc 
tgggtaagtg 
catagtcatt 
agtaatacac 
cccacaatgc 
actcaacacc 
tcattgaaag 
gcgctgctag 
cctcggagca 
ccacatatac 
cccgagccta 
acaaaggtgt 
gagaagacaa 
cacgaaaata 
ataatgaaac 
ctgaactcca 
acagtcagga 
gttacttttc 
accttaccca 
agatatctgc 
ttgcccttcc 
gtgagcatca 
cgggagtcac 
tcaaagaaca 
taagtaaagc 
agtagctgct 
ggcttcaact 
atttacagtc 
tgtctgaccc 
ccgccatgtt 
nnnnnnnnnn 
nnnnnnnnnn 
atcctgtagt 
gtgagccatc 
tcttaacccc 
tttaaatcac 
aggtcagaga 
atgagcacca 
agtatatgca 
gtaaatcgga 
aggcttcatt 
tcagacaatc 
gggaggattc 
ccatttgttg 
ccattttgtt 
actttctagt 
aggaaacagg 
aacgtgcgtc 
accagtccat 
ttcttcttta 
tattcacaaa 
cttcagaccc 
tacagcattt 



caggattcac 
agactcgggc 
gcaaatcaaa 
aagagtcagt 
actacaagaa 
gatagcacaa 
acatcttctc 
tttctcttag 
ctcttgcaga 
ctcctcttta 
cactggagac 
tttagccttg 
atggccatct 
tttaacagcg 
ctgcaaaaga 
agccttccaa 
gtcttcttac 
cagtagctct 
gccctcgctc 
acagtccaag 
agttacattt 
cagttgtaag 
ttcctgtgcc 
tcgccccacc 
attcttacat 
ctaatattct 
atgattcagc 
gctggtgcca 
tgtcaggcaa 
gacaagctgc 
tgtgctacac 
cggtatcatg 
ctgagtgtct 
agatggttat 
gcgagtgctc 
ctagtgagta 
gctccttctg 
gtgcccctag 
caccaatctg 
tgtgccccag 
cctcggagct 
nnnnnnnnnn 
nntttttatt 
tatcttcaga 
atgtaattgc 
tgagccccct 
gtatatgttt 
tgtaagatgc 
agtggacttg 
attagctgct 
gagcactcct 
gggacgtgtg 
agtgctctct 
tcctgttgga 
ccatttcaaa 
tgttcccttc 
agttactcgg 
aaattccaag 
caaccagcag 
tgttttgctt 
ttagtcggat 
caaaagactg 
atagcagcag 
gtctgaagaa 



aatgacagtg 
tcctttgatg 
cgcacgcaca 
caatgtctgt 
gctgcaataa 
ctccctgagc 
cccacagagt : 
caacttctct 
ttccaatgat 
aaacacggca 
aaaaatcacc 
ctctctaggt 
tcacagggaa 
gagaactgtt 
ggccaagtca 
actgtgctca 
cttgttcact 
gatttcaccc 
tcaacagcga 
tgtaaagaga 
tcaagctatc 
ttaacacata 
aacctggcca 
aagttattct 
cttatcaaaa 
gggtcccgag 
actcttcctc 
aaaatgttcc 
tatgttacag 
aggaaattag 
cgtgacaaga 
gcttatttta 
gatgcctgag 
gtgctactct 
ttaaatgctg 
taaatgatca 
ctggaggcaa 
caatgtaatt 
tattgctaca 
atagcatcaa 
gagccccacc 
nnnnnnnnnn 
aattatgtat 
cacaccagaa 
tgggaattaa 
cccttttttt 
ctgtatgtag 
ctttggagct 
ggtcctctgg 
cagccacccc 
tggtgcaaag 
atgccagcga 
gctctgagtg 
tcacatagtg 
tacagtgact 
tttcgtcaca 
tggaaaagga 
aaaaggagta 
ttgaacaagg 
tggttagttg 
tggatttggg 
tttatatgtc 
caagcttagt 
ctgatgaaca 



ctggatgaga 
gccatcacgg 
catcactgac 
ggactctaag 
ccgatcatct 
taaagcccca 
tgtggtcacc 
gttgttcgac 
gtctgaaaca 
gggacgaaag 
actctttgcc 
caccgatccc 
ggacagaagt 
tgttagcgat 
gggcaagcta 
tggctctcac 
tcaataaggg 
tgcagaagca 
gagcaggcca 
ccctcacagg 
agctctggta 
aaacgaggga 
ccctagttta 
ccagcctttg 
catttttaga 
tgatgggctg 
acgcccgcaa 
aacaagtcat 
atggataaga 
tttctgatac 
ggaagaagag 
cgtgtgtggg 
gtgaccagga 
gttggtgctg 
agcagtctct 
gaagccctcc 
acccgccctg 
agagcgctgc 
gacgcgtccc 
aggtctccaa 
cgnnnnnnnn 
nnnnnnnnnn 
agatatagat 
gagggcatca 
actcaggacc 
tttttttttt 
ctatgttcac 
ggacttaaac 
accgccttgc 
ctcagtcctc 
attctgtgtg 
tgattcattg 
ggtctccctg 
catagagccc 
tcagagtctt 
aagacctgag 
caatgatgct 
cacagatgct 
cggaagatag 
ggttggttgg 
aaaaatgtag 
ctgaattttg 
ggccctttgc 
ttttagctta 
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138600 
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138780 
138840 
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138960 
139020 
139080 
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tgctcctatg 
aatgcctctg 
caggtgaatc 
gatatgtagt 
aaaaagaaag 
agaaagaaga 
tgttaacata 
tttaaattat 
tgctggccac 
cagatgtgag 
ctcttcgcct 
gtcgttcttc 
gaactgcatg 
tagagtcaca 
cagtgttctg 
ctgagtcatc 
caccacgctg 
ctgagtcatc 
ctgtgttatt 
cagagatctg 
attaaaagaa 
gggaagaagg 
acgtgagctc 
gctccacgct 
ggttcagagt 
actttccgaa 
tgtaatggtg 
cccatggaaa 
ctcaggctca 
ctgcagggaa 
catacacata 
catcagtcta 
atggagaggg 
tgcaatttta 
gtaaaagtaa 
tcataaccag 
tgtgagaatg 
ccactctttc 
caagtaattt 
agatctaata 
aatattttta 
gggtcaagcc 
tgaaagaatg 
tttgttagta 
aagagttgca 
cagccccata 
aacagcctag 
tgactttggc 
atccttgggt 
caagtgggcc 
tgagttcagt 
tctgacctca 
cactcgggag 
atttacagtt 
tcagtggtag 
agagagagag 
aaggaaggaa 
gaaagaaaga 
gaaagacaaa 
gagggcatag 
cctagcaccc 
actactgtgt 
agtaatctgt 
ctagcctaga 



ttagtaaata 
gctgagtttg 
tctgagttca 
aagactctgt 
aaagaaagaa 
aaatgagtgg 
ctactattaa 
tattttatat 
cacacataca 
acacgcctct 
ctacaatcaa 
ctggccagtt 
tgcacttggt 
ggttatgtat 
gatttaactg 
tctccagccc 
gatttaactg 
tctccagccc 
ctaactcttg 
tatgggctta 
tagagtgagc 
tcaggagttg 
tatgagaccc 
accacttgcc 
attgtgcaca 
aaatattaaa 
taagaaagta 
gttgcttaat 
cgagtgctta 
tccaaaaccc 
attttaaaaa 
ggctacacaa 
agatgcactt 
tagagagctg 
taaatgttac 
tgaagtaccg 
tgtttttctt 
ccttcacgat 
tccatagcta 
ttggcataat 
tgtacagcca 
tctgagtgac 
tgggaatgct 
ttatacattt 
aaaatatatg 
atctaccttg 
tatggtggcg 
tccgaaagcc 
acgtcactca 
aatgagatgg 
cttcagattc 
gtacacatgt 
gcagaggcag 
atataaagaa 
agcgcttgcc 
agagagggag 
ggaaggaagg 
aagagagaaa 
gcaaagcaaa 
atcagtgtta 
taccgcaaaa 
actttcacta 
tcttggtttt 
acacacagag 



gctaggtttt 
gtggcacttg 
aggcagcctg 
ctcaataaag 
agaaagaaag 
atttgccata 
caacaacaaa 
gtatggtgtg 
taggcagtga 
cctaggccta 
gctctcccaa 
gagcgcgcac 
acccagggag 
gggttctagg 
ctgagtcatc 
caccaagctg 
ctgagtcatc 
caccacgctg 
cttccttgga 
aaacaagaat 
ctaggagttt 
aaggctagtc 
tgtctaggag 
taacacgcat 
ccctgtatct 
gctacatcat 
tttggcttca 
gaaaacaggt 
ctgcccttgc 
tcttttggcc 
taaaataaat 
gaccttgtct 
cctattccat 
ctattttctt 
tttggaaaat 
ttaacatttc 
ttacaaaatt 
ttattgtacc 
actctgtgtt 
atttggcatt 
gtgttttgtt 
gctgaattgt 
catttcttaa 
atttaggggg 
gtgtggtttc 
ttaaatatac 
ttaacctcgg 
ttcctggaat 
tgtgcacaca 
ctccatgtat 
tgcaagataa 
catggtacgt 
gaggatctct 
actctgtttt 
tagcaagcgc 
agggaaaggg 
aaggaaggaa 
gaaagaaagg 
actaaataaa 
taatgcttac 
tatttgctct 
gcaacataat 
atttttcttt 
aactctcttt 



tagctagcca 
cctttagttc 
gtctacataa 
taaacagcca 
aaagaaagaa 
gcctagtcaa 
tactttacac 
ggaagctgtt 
aggttctttt 
tgtaagcagc 
taaacgtgtg 
aagagtatgg 
cccagaagag 
aatcaaaccc 
tctccagccc 
gatttaactg 
tctccagccc 
gatttggaag 
accctgagaa 
atgtccaact 
ggcacactcc 
agtcttagct 
aggagaaaga 
gacgcccaag 
ttaaaagtca 
ccactctaca 
gttttatggc 
aaaaataaga 
agagaccatg 
tctgcaggca 
cttttaaaat 
caagaagaaa 
gaagctgcta 
tcttttaaag 
atagagaagt 
tgcttctatc 
gggataatgc 
cattttctca 
ctgttacaga 
atgatgctat 
tgggatgaat 
tctcaggaaa 
gcaacactgg 
ataggaaata 
ttttcctgtc 
tgtatataag 
gttacttgag 
tccagctcca 
tacacatttg 
aaaggcagtt 
caggagagga 
gtgtagtatg 
gagttttagg 
gaaaaacaaa 
aaggccctgg 
agagggagag 
gaaagaaaga 
gaaagaaaga 
atacatacaa 
ctagcatgcg 
gtcttgctaa 
gagaatgttc 
gccaaattga 
atgaactcaa 



ccttgctagc 
cagcactcag 
caagttccag 
gaatgactta 
agaaagaaag 
actagatttt 
taaccatgac 
gcagagtggc 
gccaagacaa 
accagttctg 
cagaaggatc 
gttttcacct 
ggcatcatat 
atgtcctctg 
caccatgatg 
ctgagtcatc 
caccacgctg 
cagaactgag 
aattccttct 
cttgatctct 
tatgttccaa 
ttgtgacaag 
aaggggtaaa 
gtttccgttc 
atagtcaacc 
aacttgtaaa 
ctttgcatcc 
caggagggac 
gcaggcagtg 
accacattaa 
gagctctaga 
gaaatgaagg 
ttttggtgat 
taatgcttgt 
ataaagagta 
cggccagcca 
tgcacttact 
cagtattaaa 
gaagaaatgt 
aataagcatc 
tagcacaaga 
ggactagtta 
ttattattac 
tgctcatata 
ccctagttta 
gcatatgtag 
aactgctctt 
agggatgcag 
gtttttaatc 
atgcaaaggt 
ccaacccctg 
cacatgcaca 
ccagcctggt 
acaggggttg 
gttcagtcct 
ggaggagagg 
aagaaagaaa 
aaggggaagg 
tatattaaat 
taaaactctt 
attatattgc 
actatcccac 
tgggtgaaca 
gtttcttatc 



ttacaactag 
gaggcagaag 
cctggccagg 
aatattgcta 
aaagaaagaa 
cttggttata 
aagctatatt 
agttggctac 
gttaaccaat 
ggctcagggt 
ctgttgcagc 
acatttatgt 
cccctggaac 
gaagaacagc 
gatttaactg 
tctccagccc 
gatttaactg 
gctttgaaca 
attggctttt 
gattttatat 
gcttcaggaa 
tttgaggcta 
gtgaggtgtg 
ccagccctgg 
cttggaatta 
agcccttctt 
taaatgttac 
tgcaaagatg 
ggtctcccta 
cacatgcaca 
acgagtttga 
tttgctgtgg 
tatggtactt 
tgttttgact 
aaaaaaaaag 
gagtttttcc 
gttttgtagc 
ttttcagctc 
acttaattta 
tttctgtata 
gtaaagtgtg 
acatccattc 
atactattat 
gttcaatttc 
gtctcacttc 
gtgaataaaa 
gcagagaaca 
tgcctctggc 
ttaggaactc 
ctggatgaca 
cgagttgtcc 
gaagtcccag 
ctacaaaacg 
gggatttagc 
caactctgga 
gaaaggaagg 
gggaaagaaa 
aagaaagaaa 
tttaagactt 
ggcttctaaa 
tagttgtcag 
tcctctgtca 
agtatttcag 
cttttatgat 



140580 
140640 
140700 
140760 
140820 
140880 
140940 
141000 
141060 
141120 
141180 
141240 
141300 
141360 
141420 
141480 
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141600 
141660 
141720 
141780 
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141900 
141960 
142020 
142080 
142140 
142200 
142260 
142320 
142380 
142440 
142500 
142560 
142620 
142680 
142740 
142800 
142860 
142920 
142980 
143040 
143100 
143160 
143220 
143280 
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143400 
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143520 
143580 
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143700 
143760 
143820 
143880 
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144000 
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ctccagaggt 
tttggttggt 
ttttgttttt 
agaaactggc 
cacgtgccac 
ccagcattac 
ttaagagttt 
ccagaggtcc 
gatctgatgc 
taaataattc 
ggatcataaa 
cactgctgta 
cagaaggttc 
ttatctgaga 
tctcattcaa 
ctcttccatc 
gcagtggaaa 
acactgacat 
cacaaacaat 
ttccaatagg 
gattccaaaa 
caaagacttg 
acatagcagc 
cctctacagg 
aagatattaa 
gtcatcatgg 
agctgactgt 
gcattatatc 
tagtgcacgc 
gagtttacag 
taaacaaatc 
aggtccttaa 
ctggagttaa 
agaagaacaa 
actgaggcat 
gagctgctac 
tttttaaaaa 
aaaaactgct 
taaattttgt 
tcattttaaa 
tcatttttcc 
ggtttgatta 
gactgtagat 
ccctatattg 
gagttccttt 
gtcatctctc 
tcacacctag 
aatctcacta 
actcacagag 
aagcccagca 
ctattgagcc 
gacaggccta 
agggttaaag 
taataagaca 
cactcaggag 
aatgggaggg 
tgtaatttaa 
tcatggtttt 
tccttttgtt 
gaaagactgc 
ctccaggagg 
ctggtgcttt 
caagaaaagg 
atagccctgg 



ttgtttttgt 
tggttggttt 
aatttaatgg 
tttgaacttc 
cacatctaac 
tttgactctt 
gatccagggc 
tgagttcaat 
cctcttctgg 
tttaaaaaaa 
ttcaaggtga 
ttcatttcct 
agtattcttt 
tgcagagtcc 
ggcctaacac 
tgggtcagta 
atgagctcct 
ccacccagac 
atgtagttga 
ccagcacttc 
tgtgtccaga 
aaagatggct 
tcacaatagg 
cagaaagcac 
ataatagccc 
gcagtgtaaa 
gagtggcttt 
cgtggatttt 
ctttaatccc 
agagtgtcta 
agatctgtat 
ggacaggaaa 
ggactgttgg 
gaatctatct 
gctatgtgtt 
taaaacctaa 
aagagtgggc 
gctgctgctg 
tgtatacttt 
ttggcactgt 
atttatacag 
cagaaactat 
ccaaaggaca 
atcatctctt 
catcacttta 
tttcagatga 
cctcttactc 
tgtagttctg 
atctgatagc 
aaatttacct 
agtcatggtg 
tgggttcggg 
ttttgagaca 
aatcagacat 
gtgagatcaa 
gaggaggaag 
ggcctaagct 
ctgtcttcgg 
ttcaggcatg 
agcccaagat 
caagacagac 
cctcttacca 
agactctccc 
gggaagagaa 



gggtttatta 
tgctttactt 
actggttcca 
tggtcttcct 
aatatctggg 
catagtttct 
tggagagatg 
tcccagcaac 
tgtgtctgaa 
aagagtttga 
gcctgggtga 
cattgatctt 
ttttcccaat 
cctttattcc 
tgaggacatt 
tgaagtgagt 
ctctgttctt 
ccattctctc 
caagaagtag 
tgtgtccata 
tattacctcc 
tgatggctct 
ctataacgtc 
agacatacat 
acgcttgagc 
agtaaaggtg 
cttctcatcc 
aagttaggaa 
agcactcagg 
ggcagccagg 
ttcaggaaga 
gtattccaca 
aatagttgaa 
gggcatggtg 
tgtggctaac 
aaaacaaaaa 
agaatgtctc 
ctgctgcata 
agctgtgctg 
acagaaatta 
aagcaatgta 
caaagtattc 
agtggagaaa 
ttacttctaa 
gatatataat 
gaaacaaaga 
actccatgaa 
gttgtcctag 
ctctgcctgc 
tcttaatttt 
agacaggctt 
gcagcctgat 
aaactttgtg 
aatattataa 
tccccagcat 
gaagagagaa 
ctcgactgaa 
tacatagtga 
tcttaaagac 
gatggtgcgt 
catcagtgat 
gattctgcaa 
agaacccatc 
cacacagatt 



gtgttttttg 
tttcttattc 
tgtggccaag 
tcatctacct 
tttctttatt 
tccagagtta 
gctcagtggc 
cacatggtgg 
gacagctaca 
tccatttaca 
actggcagaa 
cagaggtttg 
tctattcagt 
tgtgaccatc 
tcactgtgct 
attgaagaac 
ctgcacacat 
ttctgactca 
ctatgtcatt 
tcagctagat 
ttcgtttgtt 
tccagaggct 
atttccaagg 
gcaggcaaaa 
ttattcctct 
ccccgccctg 
cctaactgct 
atgacaaaga 
aagcagaagt 
gctatataga 
tgcctatgag 
tgtgtgcatt 
tgttcacact 
atgcacaact 
ctgggctaca 
tctaataatc 
tgattttgtt 
gcacacagct 
tcaactttgg 
acagccatat 
ctgtattaaa 
tctaaatgat 
aacttaggaa 
taatagtggc 
ttatctcatc 
cattgaaaaa 
tattcctttt 
aactcaatat 
cgagtgctag 
ctaagacgtt 
aatctttaat 
ctatagagag 
tcaaaaacaa 
acaagtatta 
aatgagggga 
aaaggaaaga 
gttgtccact 
agagtggaaa 
atagtggaag 
aagacagaat 
ctcgaaagca 
aggaatgtcc 
cctgcagctc 
tctggggcct 



tttggttggt 
atttttttat 
gatagcctca 
cccaagtgat 
ggagtttgaa 
ttacactttc 
taagagcacc 
ctcacaatcg 
gtgtaatcat 
ctggacttct 
gtggcagaag 
ctaacgggaa 
ctttagtagt 
tcgttgtttt 
atgccaatcg 
ctggtgtcct 
tgaaatcaac 
tgtcacctct 
gtccacagtg 
gtgctgctga 
tcttctaaat 
gggtttgatt 
ggtctgactt 
cacataaaca 
gatgatacag 
cccaggggca 
gcaacatcat 
ttagatctat 
aggtggatct 
gaaattctat 
ctgtttgaca 
ttacagaaat 
cagtgttctt 
gtattcctaa 
tagtcaatat 
tgggatttta 
cagatggcca 
aatatttgac 
aaaaaaagta 
tagtctagac 
tattcagtct 
gaaaaaagat 
gaattttccc 
catttattga 
ctaaaatgac 
tctaacttgc 
ttttttttcc 
atagaccagg 
ggttaaatgg 
tctcctctta 
ctcagcactt 
agttctatgg 
acaaacaaag 
gtgtcactca 
ggaggggaga 
tctcaaagca 
tttaatgacc 
ccaagggtca 
caggtatgca 
gcctggtgaa 
gcctcactgc 
agcatctgaa 
tcaccaatgc 
ccctctctgt 



tgtttgtttg 
ttctttattt 
actttgtagc 
gggattaagg 
agggattcct 
atttgttaca 
aactgctctt 
tctgtaatgg 
ataaataaaa 
tgaggcagca 
ctgtgtctca 
gtaagtggaa 
agatccctca 
tcagggagtg 
ctctgcagcg 
gcctgtggct 
tagcttgcaa 
cataggtgac 
catggatttg 
tagtatttta 
aagccaggca 
cccagcccca 
cctgttctgg 
taattgaaag 
ctctcctgaa 
tcagtgaggt 
caactgtgga 
ggccaggcac 
ctgtcagttt 
ctggaaagaa 
tgtgtgatag 
tggttataca 
caaatcaaat 
catgtagaag 
tggacagtca 
tattttcctt 
cagaacctag 
tatacgtata 
tcccagttta 
acattaaact 
tatctacagg 
tcaagaatct 
tttatcccct 
acatacccag 
ctgttgatga 
ctgcataaga 
tttgagacag 
ctagcctcaa 
atgtgtcacc 
aaaaatggaa 
aggaggcaaa 
gttaaagttt 
ccagactgct 
attaaaaagt 
gaaggaaatg 
gaacacagga 
cttttcatgc 
tcctggaatt 
tgggtatcct 
cctggggaga 
caagccaacc 
aataaagatc 
cttcgaggat 
cagcttatgc 



144420 
144480 
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145200 
145260 
145320 
145380 
145440 
145500 
145560 
145620 
145680 
145740 
145800 
145860 
145920 
145980 
146040 
146100 
146160 
146220 
146280 
146340 
146400 
146460 
146520 
146580 
146640 
146700 
146760 
146820 
146880 
146940 
147000 
147060 
147120 
147180 
147240 
147300 
147360 
147420 
147480 
147540 
147600 
147660 
147720 
147780 
147840 
147900 
147960 
148020 
148080 
148140 
148200 
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acacaccctt 
ctggtgaagg 
cattgccttg 
tgtctgaagc 
agccaggaat 
gggctgctcc 
agatttgaag 
acctgtttac 
agtgaagaga 
tatggaagta 
gaacacagat 
agcgtgcagg 
gcagatttga 
gcccaggcaa 
attggtattt 
aataaatcag 
agctttgagc 
tctaagacag 
tctcttaaaa 
caagaggcaa 
caggctacat 
caatagcaac 
gcttttttca 
catgtatgtg 
gccactgagc 
atttattcat 
ctaccatgtg 
tcctgaacca 
tattatatgt 
tagagatgat 
gcagtcagtg 
tggtttccat 
gaacttggtg 
aatgatggtg 
catactacac 
tggagagtga 
aagaaaagtc 
tttctccccc 
gccggaggat 
aggacgacat 
atgttttgcc 
gggcatcaga 
ggaatcaaac 
tttattacat 
cccccccccc 
tctagttaat 
cagattaata 
tacactgtcc 
tgccaattta 
agcagtagga 
agttagtctc 
cctctcaggg 
gtgagtggaa 
atgggtgaga 
ctctgacaat 
ttattggcag 
ggacagaggt 
cccaggaatc 
tttcaccaag 
gagaaggaga 
tcaagacact 
tctgccttca 
tggctcggct 
ggtttggttt 



tgaaaggccg 
aagatgcttt 
agaacaagtg 
acactttggc 
gtatgcagca 
tcacactcgg 
atgttgatag 
ctgaaagcaa 
agagcaagat 
attataatgc 
tcatctgtgc 
cccaaacctt 
ctgtttgggg 
taacatccat 
caagagctaa 
tgacctaaaa 
acatttctgt 
ctttagctat 
acttgagcca 
aggcaggcag 
agtaagatcc 
aaattaattt 
gacacaccag 
gttgctggga 
catctctcca 
tttgtgtgtg 
tgtcttggag 
tctccctagc 
aagtacactg 
tgtgagtcac 
ctcttaactg 
tttatccaaa 
tgtgagcatg 
gcctgcctga 
actagctgag 
tcatctccac 
cttttaaggt 
tccctcattg 
ctttgtgaac 
agatataccc 
tgcatgtatg 
tctctcagaa 
caaggtcttc 
attttcctca 
ccccgagcag 
taagctgaaa 
ctctgttagc 
agattttgcc 
tacagtgttt 
tttgaaagct 
tgtggtacac 
tgtccagaag 
ttctgcagca 
gggccttctc 
cttagcctgt 
taaatcagag 
cattggctga 
tcaggtgctt 
gttatgtatg 
agccctgctg 
gatagacttt 
ttggctcttt 
agtaaaggtg 
ttgttttttt 



ccatctagtt 
tgtaagtgtc 
ggataccaga 
cacagtacct 
ggcatgggac 
ggtcttctgc 
agttgtaagg 
gaatctggtc 
aggtgaaccc 
tatcttcagc 
catcagtgtc 
aaagtcccct 
agcagccatg 
atgtcacaca 
tgtttaagga 
ggaaaacaca 
gaacccagag 
acagcacgtt 
gagccaggtg 
atccctgaat 
tgtttcaaat 
tttagatgta 
aagagggcat 
attgaactca 
gtccattaat 
gtatacatac 
atcaaactca 
catttctagt 
tagctgcctt 
catgtggttg 
ctgagctatc 
gatgccttta 
aagaccagag 
gattccagga 
ctgtgtgctc 
aggcaagcac 
ggtggtggtg 
tgatggcaca 
tggaggtcag 
tgtctcaaac 
tctatgcaca 
ctggaatttc 
tggaagagca 
attacatttc 
caagtattct 
gggagggagg 
ctaactacac 
atcctttttg 
ataggtattc 
gacatttatg 
attttggggt 
ctgctgcaca 
tcccggtcca 
caagggcctt 
gcatagtttt 
gttatccact 
aggataacgt 
gctgactggg 
gcagagggta 
ataagggaag 
gtccttaatt 
gagccactgc 
tttgcctcta 
gagacaaggt 



gccacaaaag 
cttatcccag 
gttaccagtg 
tgtggcagct 
ctctaccata 
catgttgccc 
atgctttgtg 
aaggcatttg 
ccaattcctg 
cttttgcccc 
tccgcactgt 
aagctgaaag 
ctgcgacaca 
ggtaagtcag 
gaaaaacact 
gccgtcttat 
cttgggaggt 
tgaggtcagc 
gtggtggcac 
ccagcctcgt 
aaataaatat 
tttatttatt 
tggatcccat 
acacctctgg 
taaaaattta 
tataatacag 
ggttcttagg 
attcttttct 
caaataccgg 
ctgggatttg 
tccagcccca 
agggcctggg 
ttcagatccc 
caacggagac 
aagagagcag 
acacctgagc 
ttgttgtttg 
ttcctttaat 
cctgttctac 
agacaaaaat 
ttatgtctgg 
agacacttat 
gcaagtagtc 
caatgctatc 
tcattgctgg 
tagatgttgc 
tatagaagct 
ngtgtatatg 
tttgtgacgt 
tggcctatgg 
ctatctgcgg 
ctgggctgga 
gctgggagtg 
cttagtgtta 
ttattcgaga 
cttagggaag 
actgagatgc 
tttcttaggg 
taggggtttc 
tcccccattt 
agcagggccc 
tacaaaatat 
agcctgaagg 
ttctcagcat 



acattctccc 
gagaaatgcc 
gagacctcta 
gcagctatgc 
gcacagagtg 
tgagaatgac 
acgaggttgg 
attacacaag 
gatgcaatgt 
atactgaaaa 
gcagacaact 
cagtgacccc 
tgctacagct 
aaatggtttt 
ataaaggaag 
atatcattat 
ggagatagga 
ctgaactaca 
atgcatttaa 
ctatatagtg 
aacaaaaaca 
ttatgtatga 
tacagatggt 
aagagcagtc 
aaactagagt 
gttcataagt 
catgggagca 
tttgtcttga 
aaaggggaat 
aactcaggcc 
tttttagtat 
aagatggctc 
tagcacccag 
agacaggggc 
ccctggctta 
acacagacat 
ggggtctttt 
ctcacatctg 
atagcaagcc 
tatttatttt 
tgcccatgaa 
caagtactgc 
tttttttttt 
ccaaaagtcc 
gccatctccc 
ccaaacttag 
tattctttag 
tctacagatc 
ggatctttta 
tcctgttaaa 
ttccgcatct 
aggatgaagt 
aatgctgggg 
cagctctagg 
tgagcttgta 
gagataggaa 
tcattagcac 
ggttgagagg 
aggctcccca 
tgagccactt 
aacaagtgtc 
ccaaatctgg 
cctgagtttt 
agccctgggt 



aggagaactg 
acgacctcat 
ctgtcaccga 
caagtattgc 
ttctcttggg 
tcttttagcc 
tagcacagac 
tcagggagag 
caatggaaag 
gcatagccca 
caaagctgac 
agggctgtgt 
gcagtgtaat 
tacttacatt 
cctggcatca 
gctattgaga 
tgattaggag 
tgagaacttg 
ttctagtact 
agatccccac 
gcaataataa 
gtacaccatt 
tgtgagccac 
ggtgctctta 
atttttaaac 
caattctctt 
agtatttact 
aagatttatt 
caggtcttgt 
ctccagaaga 
tcttattaag 
agtgggcatt 
gcagatgctg 
cctagctaac 
ctgtgcagga 
gcacaaagga 
gttttgtttt 
ggacaaagag 
catttcagcc 
atatatttga 
agccagaaga 
ctgagtgcta 
ttaatatttt 
cccataccct 
catctccttt 
gatttattga 
actttcacat 
ttaattcagc 
cccatcttaa 
tcacatttca 
cacacttttc 
ggagtccaga 
tcaggaggag 
caaaggcctt 
tgcatacact 
tacccaaggt 
ggggaggcat 
gtagcagtga 
gacaaagaag 
cagaaggcta 
tgttttcttt 
gccaggaagc 
cacttgattt 
attctggaac 



148260 
148320 
148380 
148440 
148500 
148560 
148620 
148680 
148740 
148800 
148860 
148920 
148980 
149040 
149100 
149160 
149220 
149280 
149340 
149400 
149460 
149520 
149580 
149640 
149700 
149760 
149820 
149880 
149940 
150000 
150060 
150120 
150180 
150240 
150300 
150360 
150420 
150480 
150540 
150600 
150660 
150720 
150780 
150840 
150900 
150960 
151020 
151080 
151140 
151200 
151260 
151320 
151380 
151440 
151500 
151560 
151620 
151680 
151740 
151800 
151860 
151920 
151980 
152040 
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tcactctgta 
gcttagatta 
gatggctcag 
gcaaccacat 
ctgaagacag 
aagaacacta 
ttcattctag 
gaaaggccac 
ttctacctgt 
gatggttgtt 
gccaccacat 
agttacccat 
gcacatattt 
agcccagttt 
aaaaacaaac 
ttgggcgtgt 
ttgggaggca 
gaaaccctgt 
gttaatatga 
ggagacattc 
cnnnnnnnnn 
nnnnnnnnnn 
aaccgtcagg 
attaaagcac 
gagaggcaca 
ctgggagaag 
agcagctatt 
tctgcagggg 
ttgtgtctat 
gtgtgtgtgt 
gtcagtgctc 
ggccaggctc 
taggaaataa 
aattggattc 
tatgcagagc 
gactgggctt 
tgcccgtcct 
gctactgaaa 
cagggccctc 
ctcctgaggc 
ccatccaacc 
agaaacctct 
gctaatgtct 
tgtgttgcat 
tgattcttgc 
gcccagggac 
tttttttctt 
tagtttataa 
tttatatatt 
taagtggaca 
aaatgtcatc 
ccgggggctg 
gtggcatagg 
tctgagattg 
tccaaaacca 
agcttcttca 
cactgtgctg 
atacccttcc 
cggtggctga 
aaaaaaagaa 
agtgctgggt 
ggccaggagt 
acggatgggg 
gtcaaagctg 



gatcaggctc 
aagttgtgcg 
tgggtaagag 
ggtggctcac 
ctacgggtgt 
ttcacaaggt 
gggaaaaatc 
agcacgtgta 
ttttgaggca 
ctatttctgc 
aagcatggca 
caagccatct 
aatcccagca 
acaaatcaag 
atccttctgt 
caaaaatcac 
gaggcaggag 
ctcaaaaaaa 
aatagcagaa 
catactgaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ctcagaggca 
caaatcttgt 
catagcttcc 
cttgaagctc 
cagcagctgc 
aaagacaagg 
gtgggtgagg 
gtgtgacaga 
cctttggctc 
gcccacccca 
aaaacagggt 
agcatttctt 
acaagagact 
gcctgcaggc 
ggggtcatcc 
caccctcatc 
agtaggtcac 
cagagctcag 
cattggacta 
tccccagctc 
gcagcttcta 
ataaaaacga 
tgcttctctt 
tattgttgct 
gctagaagtt 
acatgataag 
tatatatata 
taggtataaa 
cactggcgat 
cctgagacgt 
gttggacagg 
ctggttgccc 
gcagggtcct 
aggggcgcat 
gagtggatac 
cgtctacggc 
aggcttcagc 
agaaagaaag 
gacggagccc 
ggggccccaa 
gtcctgcgag 
aacttgtgct 



aacttgaatt 
ccaacactgc 
cacccgactg 
aaccacctgt 
acttacatat 
acaacaacag 
tttttaaact 
tccaggttag 
gtctcttgtt 
ctctcatgtt 
ccagcactga 
tgctagcccc 
ctcaggaggc 
tcccagaata 
gttccagtca 
aagtagcaag 
gatctctgtg 
ccaagcaaaa 
ctcaagatag 
tatatgcaga 
nnnnnnnnnn 
nnnnnnnnnn 
ggatggcagt 
actaaaaaca 
gggcccttgc 
aagaactggc 
ggctcttggg 
agatggcact 
gaagtgccgt 
gacagaaagg 
ctcctgccaa 
ccaaagccca 
tcttgtccaa 
actcctcagt 
gtgcttgcca 
tgccttctgg 
aggaggagca 
tgctaaggcc 
tggccaaaga 
agcctccctg 
acactgtaga 
cgcctagctt 
cagtaggggt 
ggtgatgttc 
ccttcccgac 
ggctggggtt 
accgatacag 
acatacagtt 
tatatattat 
actgcacctt 
ctctcctttg 
caggaatgag 
gcaggttaac 
cagctggctc 
tcagcccttc 
gtcccgacac 
ggtccagagg 
tgtacatttt 
agccttttcc 
aaaagaagaa 
tgggtcctca 
gggccgtgcc 
cacacttgcc 
gatagtcggg 



cagagatctg 

ccacctaaaa 

ctcttccgaa 

gatgagatct 

aataataaat 

tgtactagga 

ttagtgtgta 

gagtcaactt 

tctgccctac 

gccctatgca 

gccaagggca 

agaaaatgta 

agaggcaggc 

gccaaggcta 

caatgactgc 

gtgtaatggg 

agtttagcac 

atagaattac 

ctaatgaaac 

gcctgatgct 

nnnnnnnnnn 

nnnnnnnnnn 

gatgtgtttt 

gccctctgct 

cctctgctct 

cgctgttgcc 

caagcttgga 

gagctgggta" 

gtgtgtgtgt 

gagagtgcat 

aaagcatgct 

ggatctgcca 

gagaattttt 

atcctctctg 

gaaacccttg 

gctgacccct 

ggaccactgt 

actagatttt 

gcctagatgt 

gcctgcacaa 

cgctgccttg 

gcctccagca 

gacggggtgc 

taagtatcta 

tcttcccact 

ctgaacaaaa 

tccttaattg 

tggtaaggag 

gtgggtgtgt 

ctgtgtgact 

gtgattggtt 

aggcattacc 

taagtgatct 

tgggagagcc 

tctccaaatg 

cactaatgac 

cgctcggtca 

gggcttataa 

tgaaacccag 

aaaaattctg 

gcggagagtg 

tgaggatgcc 

cttcctcttg 

ggcatagtcc 



cctgcttcta 
aaaatatgag 
ggtccgaagt 
gatgccctct 
aaatcttaaa 
aactttaaaa 
agaaagagag 
ttcagaagcg 
actttgtaca 
tgctgagctt 
ggcatctggc 
tttttgacag 
agatctctgt 
catagagaaa 
tgtaacaata 
caggccttta 
agccagggct 
aagttaacca 
aaaggattct 
tgcctagcaa 
nnnnnnnnnn 
ngcctgggca 
ggggcacaag 
ctcgatctgc 
cccggtcagt 
tttgtgctga 
agtagctcgt 
gccaggatgt 
gtgtgtgtgt 
gttcttgtgt 
gtgttctcag 
ccccattaag 
attattattt 
gtagggaatt 
ggccaacagc 
gagtctggcc 
gatacagggt 
tttcccactg 
taaaactgca 
agtgctaaga 
tctgccagct 
ccctgacatc 
tccggggtgc 
agaatgttgg 
gacacctttg 
ttgccgcagt 
agcaaatata 
tgggttgggg 
gggcagagtg 
ctcattgcga 
cttggacccc 
cccatggata 
cagacaagag 
ttgttctgag 
accaggcttc 
tcactttgcg 
ggacagccga 
tccaccagga 
cagatcttcc 
ttccttgctg 
actgccagcc 
tgctgcaccc 
ggtcgtgcgg 
tgcagttcgg 



catcccgagt 
gggctggtga 
tcaaatccca 
tctggtgcat 
aaaaaaaaaa 
tagcccatca 
aggggctgta 
gagtctcccc 
tgaacttcaa 
actgatgcca 
tcacatggag 
gtgtggtggt 
gtctgaagcc 
ccctgttttg 
atatgaggat 
gtctcagcac 
gttacacaga 
gggtattggt 
attttataaa 
ctgaactaca 
nnnnnnnnnn 
agagtaagca 
gccccctttc 
agagatacga 
tcctggactc 
cgccagaacc 
tccttctcct 
gggaagtaaa 
gtgtgtgtgt 
gttcctgtga 
agtcaagctt 
aatgcggggt 
tttctctctt 
caggtctgtc 
cctattccct 
ctctgacctc 
tcctgagatg 
ccagactact 
gctaaagcct 
aggacattgt 
tgaaaacccc 
cacttctctc 
cagacaggcg 
tcccctgaag 
cccccagcaa 
tttttgtttt 
ggttcccgta 
gacctgtgca 
aggatatata 
gtacagttct 
agcaggtctg 
gggactgagg 
ccaagagtgc 
tcctgctcct 
cgcagagccc 
tgcctttgac 
gtgagacgtg 
agggtccaga 
acttaggaaa 
gacatgtggc 
ccagtatcca 
cactgggggc 
tgggcatgat 
taagctcttt 



152100 
152160 
152220 
152280 
152340 
152400 
152460 
152520 
152580 
152640 
152700 
152760 
152820 
152880 
152940 
153000 
153060 
153120 
153180 
153240 
153300 
153360 
153420 
153480 
153540 
153600 
153660 
153720 
153780 
153840 
153900 
153960 
154020 
154080 
154140 
154200 
154260 
154320 
154380 
154440 
154500 
154560 
154620 
154680 
154740 
154800 
154860 
154920 
154980 
155040 
155100 
155160 
155220 
155280 
155340 
155400 
155460 
155520 
155580 
155640 
155700 
155760 
155820 
155880 
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cccagagctc 

cttcttgtaa 

ggccccatgg 

agacttgatc 

cagcttcaga 

gccacggaac 

attcccattg 

ctcagtgagg 

cttgtggtga 

gccccggaag 

gagattgacc 

gtagagatac 

acacttatag 

cgttcgcagc 

ggtgttggga 

gctgaagtgg 

gtcctctggg 

gggcgcaggg 

gagcagcaac 

ggggctgccc 

aatgactctg 

cagggcattt 

tctctaattc 

ttgtgaacac 

tagctgcctt 

tatttaaata 

aataagcgcc 

gtttcctcat 

tgaggggctg 

tgccagatcg 

attgccaggc 

tgtagcttcc 

attttcggat 

ctcattatgg 

ccagcaggcg 

ggccaggcag 

gttcagctgg 

ttcttgctgc 

cccacttggg 

gacttctgac 

gctgttggga 

ggcttcagct 

aattgggttc 

taggacccct 

cggcccttag 

tgtctagatg 

ccagcctctg 

ctgccctgtg 

tccctgcttg 

cggtcccctg 

tttatgctgg 

cttttcttcc 

accattcgac 

ttcccctgtg 

ggaagtccct ■ 

gctctattta 

agtttataaa 

actggggttt 

tgcagaacca 

caatggacca 

ctggaagcat 

tccacatgcc 

tgaggggctc 

ctgagcaagc 



accttagaga 
cctgacctgg 
gacggatggt 
tggtgaggag 
tcctggcctt 
cttcgcagcc 
aggcgaagga 
ctgttgttga 
acccactgta 
atgccttggc 
aggtccacaa 
tgcaggctgt 
aggtagaggg 
tgtcggttgt 
gcaatgaaag 
ccctgctgga 
atgccctccg 
tagcacacac 
agcagcaatt 
agaggaggtg 
ttggctcaca 
ggggccaagg 
catccgaaca 
aggcgagtcg 
agcaaggaat 
attcctcaaa 
tctttcacac 
gagttaaaca 
tgagtgacac 
aggctgcatg 
tgtggctgta 
ctctaattgg 
gtgtccagcc 
acgattgtca 
cgtcaggaag 
ccacgaccta 
gactcttggg 
tagctgtgct 
gtcccaggcc 
gccccccccc 
gttatgggct 
agaatgtttc 
agttgtgatt 
ttctgggccg 
acagcatgca 
ttactctgtc 
tcctagggat 
ccaccagcac 
gcttgcaccc 
ggaaagcctg 
atgagacaca 
ccggcctgca 
atttgtacag 
ttctcccacc 
tggggagatt 
ggaaaaaaaa 
cataccgtgg 
tccctctaaa 
ggctcagttc 
caaggacctt 
actaatagga 
agggatgtta 
agccactctg 
tgtcctagtc 



tctggttccg 
agccaggcgg 
gctccttgcg 
acactggtcc 
gccgcagctc 
attcccacag 
actccaaggc 
agagaaagag 
gctggttctc 
ccaggctcca 
agatgtcatc 
gcaggccacc 
cgtgaagctt 
ctccaaggtc 
tgatgttgtt 
ggaaggtgat 
ggatggcagc 
agtctcgagg 
ccacacagca 
gagatagatg 
gaggttctgt 
cccacattat 
cttcttcaga 
gcctccggct 
gactctaaca 
ataaagaaag 
tgtggtacaa 
cagttcacag 
cagggctgac 
ctcctctccc 
aaaccgggcc 
gcattaatta 
tgtggtcggg 
ttacctgcct 
atggggacag 
tgtcgcctca 
agagccaggg 
cagagcaagt 
cagggactgc 
cattcccaac 
cccagagagt 
tagcacataa 
tcagagcaat 
ggccatgccc 
ggctataacc 
gatcttcctc 
cctgtgctgc 
tcacactggc 
tctcttccag 
gactctaggt 
gcagagcctt 
gctgcaggca 
ctcatcggag 
tactgtggct 
acccccctta 
aatcaaactg 
caagtggagg 
agggcaatgg 
ccctgttatc 
gtccttgtag 
tgtaggctca 
accttccaag 
catccatgtc 
agccctggcc 



gttcctgtgg 
atggccatgt 
ggcagccctg 
tgtgcagttc 
gggggtcgcg 
ggaacgtgcc 
caccaggggg 
ggtggttagc 
atgcagcagc 
tagcttgtta 
ttggaggtac 
aaagatgcct 
caccaggcct 
tagctcctcc 
ggagtagatc 
gcgattgttc 
aaagttgtgt 
acaaccacca 
ccctggtggg 
gggaacagag 
cctctgtatt 
ctcctcacct 
ctgtcagccc 
ctgggtccgg 
aagcaaatct 
cattgagtcc 
aaacttcaaa 
ggctgtgtgt 
gcacgaggct 
ccattcacac 
ttgcctcttg 
ggcattcgtt 
taataggcct 
ttttccaggg 
gctccaggcc 
gttggcctct 
cccctggggg 
ggaaggagca 
ccaggccccg 
agaaacagca 
ggcagctgct 
aaatcatcgc 
tatgacctca 
tgcaggccct 
atctctgcct 
ctcagcatcc 
agagggaggc 
tgcacagtcc 
gaaggccatt 
tcaaggacat 
cttcactggg 
ggtaggggtt 
tttacagagg 
gctctgtctc 
cataggggcc 
aaccaaacct 
aggcaggcgg 
ggggacaaag 
tcaaggctat 
gacagtcact 
ggacagctgg 
cctccatctc 
tttgaaagcc 
acttctggac 



ctggtgcagt 
gggtggcctt 
tcagaggtgg 
cggaagtcct 
caggggacag 
cggcagccac 
gccagacagt 
ctgtggaggt 
aaccggtcca 
ccatggagaa 
tcgatatggt 
gcgggcaggg 
tggaaggtct 
agatgcacaa 
cagagggtga 
tgcaggaaga 
gcctggcagc 
cccagaggta 
gagagacaga 
ggtggaatgg 
gtatgcaggg 
ctttagctct 
caccgcaggg 
ctctgccact 
ggagtcctga 
atggtaccaa 
cctacaactc 
acagacaagg 
tcccttgggg 
ccccccctcc 
actgtccaga 
aatggatcct 
atgctcatta 
tcacagctgc 
tacgggcgcc 
tgccccttct 
aatatgagct 
gggaccttct 
gcagagtagg 
tcgtaagttg 
tctcgtccct 
atataattta 
gcagcagggg 
ctgcagtgct 
caatttcctg 
tgggtgtggc 
acagtcggag 
acagacccac 
cttgccagaa 
tcatgatgct 
gggtcctgtg 
gcagtgggct 
gcttttgttg 
tgtgcacccc 
tccaaggaat 
caggtgtgga 
cagaacggca 
ggactctagc 
catcactagg 
tcctgcagtg 
tctctgtcct 
ttctaatggg 
agtggtatta 
tccttgcctg 



tcttgcctgc 
tgtccctgga 
taagcgtgtg 
ccaccctcag 
cagagctaga 
agtcccaagc 
caccctgcag 
catggaaagc 
ggttcaccag 
acaagtgact 
tgtcctgcaa 
cgctcagtcc 
cgggtgccag 
agccctcgaa 
ccatggcggg 
tgcgctcact 
tgacagtcat 
gctctccagc 
acagcagtga 
gggactggca 
gtcccttgga 
gtccctaaag 
tggaacacac 
cgctcactgt 
atgatcactt 
agcatgcctc 
ctccatggct 
cacatttctg 
ttcacagtac 
tgtgctggag 
gcatttcctc 
taaaataatt 
tggaagccgc 
cccaagtggc 
caccctgaag 
tttccagctt 
gagctgaatc 
gaccaggctt 
ttgccacctt 
acagctccca 
gtaatcaccc 
gtttttgcat 
tggcacagca 
ttctgcccac 
ctccaaaacg 
ctccagcctg 
ggaggggagc 
agctccaacc 
cctttcccaa 
tgccccacat 
aaaatgaaag 
tatcactaat 
taccctcaac 
aaaagaatct 
acaggctaca 
cttagtaacc 
tgaaggtagg 
caagacctga 
aggcctaagg 
aagtgctctt 
ttagtatttt 
gggggggtgt 
ctccaggacc 
agtcagtagg 



155940 
156000 
156060 
156120 
156180 
156240 
156300 
156360 
156420 
156480 
156540 
156600 
156660 
156720 
156780 
156840 
156900 
156960 
157020 
157080 
157140 
157200 
157260 
157320 
157380 
157440 
157500 
157560 
157620 
157680 
157740 
157800 
157860 
157920 
157980 
158040 
158100 
158160 
158220 
158280 
158340 
158400 
158460 
158520 
158580 
158640 
158700 
158760 
158820 
158880 
158940 
159000 
159060 
159120 
159180 
159240 
159300 
159360 
159420 
159480 
159540 
159600 
159660 
159720 
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tgccaatcct 
atggcgcctt 
attaatgtac 
aggaatccaa 
cttgggcagg 
cttcccctcc 
tcccacgcac 
acttgccccc 
atcatgcccc 
gaaattaaga 
cttggtcact 
tttgaactag 
gctcagacct 
tgggtcagct 
catggtccca 
atgtgcaaag 
agcctccaag 
accagaacca 
aggggccctg 
gtctttaatg 
agcatcagtg 
ccctgccttc 
tcgaaatggg 
agcacccaag 
ggtcagtggg 
ggccttcagt 
tgttaaccga 
ctggggctgc 
ctccttcctg 
ctcagcctcc 
cagggctgtg 
agcctcgttt 
cccacagata 
ccaaaccaag 
taaagatggt 
ctctggggaa 
actcgctatt 
gccctgcctc 
agtcaggaag 
ccccaggtct 
ccgtccctcc 
tacctggacc 
agcgatttgc 
acagcacgac 
tgctccctgc 
gtctctggag 
tcatagtagg 
ggagaacaga 
ccgtttgttg 
aaatagactg 
atgctaaaga 
cagaaatgtc 
aggtgcttgg 
aagtgagttc 
taaaaatctt 
acacacacac 
gggctgaact 
tgtctgtgta 
tggatggaga 
actcacatct 
cccaggccca 
gcccagcatc 
agcaccccag 
cccaggccca 



aggattgtca 
ctatgccccc 
caacaatgag 
gtctctgtgg 
gctgccttct 
aggaggtgcc 
gtgagctccc 
tagtctagga 
atttgcatat 
gaggggccta 
gatcccctct 
ggctgagaag 
cagcacacac 
ggcaagcccc 
tgggaccaga 
gcactggggt 
gaatactgta 
gcatcaacag 
tagcaccctg 
aaaagccctt 
ggctggttgc 
aggctggggc 
ctgggtcagc 
ggggctgccc 
tctctctgaa 
cctccctccc 
atgctctctt 
tggcgcctca 
gctcggccct 
actgctgtgc 
gagccagagg 
tcctcacctg 
ctggtaccca 
ctcttctctt 
ccctttacat 
attctgctgg 
gctaccattc 
ctcaccccta 
ggctgctcct 
ttctcccagc 
cacatgccag 
ggttcccagt 
aaggagcttc 
ataagtcact 
acacagtagg 
aattgaatga 
tgctcgataa 
cacctgaact 
gatgagcact 
ggactgggga 
aaaaaaaaag 
tgggtttatg 
gaggcaaaga 
taggacagcc 
agtgggggag 
acgttagtat 
cagagtgtta 
gatgtgtacc 
catggttgtg 
acctctggag 
gcaccccgac 
ccacgcccag 
acccagcacc 
gcaccccagg 



ccagcaggtt 
tctatgaggc 
gatggagcct 
aggggccctg 
acttgagggg 
cttcatcgtg 
aaggggctct 
gagccatgga 
gcggtggcaa 
aggaaagcct 
gtaacaccag 
ggtctgctct 
tcaacagaag 
aaactcttcc 
cctggtagga 
gggaaagggc 
acctcagctc 
ggcctgtctg 
gccaccagaa 
ccggaagcca 
agctggagaa 
tctagccagc 
tttttcaggc 
ctctacccct 
gctttgtagg 
tgcccccagc 
gctgtgggga 
ggccacttgg 
gaagacaaga 
agagcggccg 
cctaagttga 
caaaatgggg 
cctgcccttc 
cctctccggc 
cagctgcctg 
ctagatttta 
acccagtagc 
cacctccttc 
tagtctgcat 
ccagaggccc 
ccctcagcct 
gtgtctgcag 
tttcctggga 
tgtttccttt 
tgctcattaa 
tgacactcag 
atgttggttg 
tggctcacat 
ctgagaagga 
gtttcccaag 
ttagtcactc 
aagagaggaa 
cagatggagc 
aaggctacat 
cagtcaagga 
aatatcatac 
gtgtgtgcta 
tgaggtgttt 
tttgctaggc 
actgagagtg 
ccagcatccc 
catcccaggc 
ccaggcccag 
tccaagcact 



tcttcctagg 

ccttgggagc 

ttgccatgca 

cacctcttct 

gcggaaggga 

ttctgcttcg 

acagcctccg 

agacagtgtg 

taccctggtg 

ctgctatccc 

cccttctgca 

gttcagctgc 

gcatgcaagc 

tgccaagctg 

taggtggcaa 

ctgtgcttct 

agctgggctc 

ctaaacccag 

ctaacgagga 

catttgcaca 

cacagcaggg 

tccctgatgc 

tccaaagggt 

aagtagaggc 

ctgcttcttg 

ctggccagct 

tggcggcctc 

cacttgtgcc' 

aagctggcca 

gcagcctcct 

atcctggctg 

cagtcattgg 

ccacatctcc 

ctcctccgtt 

ggaggaacac 

gaatgtatcc 

ctcctggatg 

aacaggaact 

cctccagagt 

tgcctctccc 

ccctcctgcc 

ggctgcttcc 

cattttactc 

atgtccgctg 

cagttctggg 

gaagccctag 

taattagtcc 

caagactcct 

cccaatacca 

gtatgggatt 

taaaaggggc 

gccagatatg 

tctgtgagtt 

agagaaaccc 

ggaaacacac 

tatggctctg 

gtggatattt 

atgtgtacac 

atttgcatgt 

acaacccagg 

aggcccagca 

ccagcacccc 

catcccaggc 

caatgcccag 



gaggcaagca 
cccgccccac 
ttttaacatt 
gccagactca 
gaagacccag 
ttactctcaa 
tcattccttc 
ggaagggctt 
ggtaccagga 
tgggctacca 
acctgccaga 
cttggctggg 
aagggagcta 
agcatgaaaa 
ggctaaggca 
caccccctct 
gggtggccag 
acctcacaag 
agatctgacg 
gaagaaaatg 
ggacaggtcc 
ctggagtagg 
caggacagct 
atccccatgg 
gccatgtagc 
gctcttctct 
aggccaggcc 
acttgtgttc 
gggaacagct 
atccatggct 
ctccttttaa 
aagctcagcc 
atctgtctaa 
cacaacttct 
ctcagaatca 
agtatctatc 
ccctcccccc 
agggtagtcc 
gcccatctaa 
tggctacaca 
tttgtccatg 
cagagaagcc 
tacagcacgc 
ccact'ttgtt 
ggagggaatt 
gctcaagcct 
tgggagactc 
taggcccatc 
gtcctttcct 
ttcagcacta 
caggaccaaa 
gtggggcaca 
tgaggccagc 
acttgacttg 
acacacacac 
tgcctgcagt 
gagctctgta 

aggtgttggc 

gtctgagttc 
gcccttttat 
ctccagaccc 
agacccagca 
ccagcacccc 
caccccgacc 



ctgtatcacc 
tgattgcctg 
gcaaattagc 
tcaagcgcct 
ttccactctc 
gcctccggcc 
ttccattcat 
gacaatgagc 
gagtataggg 
gtcagcattg 
gtttttgacc 
aggggaatct 
gcagtggcct 
gccacctcac 
gcggaatagc 
aatggtgcag 
agagcttggc 
ccagtttagt 
ctgggaatat 
aggtgcccag 
taccaagcta 
taaagcagcc 
gctgcagctt 
cccctgggca 
caatctctct 
gagcaatcga 
agctgcactc 
taaacacagg 
gggctcccat 
gtgagtagaa 
tgcttgcagg 
agtcctccag 
ctgaaaccat 
ccatcttggg 
ccgtggctca 
cacttatcat 
ccccccgcaa 
agggaaagtg 
ctagaagctg 
gaccttgctg 
ctgttccatc 
actcttgagc 
gacctcttgg 
ctgatgagtg 
accttctcaa 
ggggcctgga 
agagccttca 
aaggaagtga 
gggcaagggg 
aactggaata 
acctttcaaa 
tctttaatcc 
ctattctaca 
ccacccaaat 
acacacacac 
ccaggaatga 
tttatgtgca 
ctgttgcata 
atgcacataa 
cctgcagcac 
agcaccccag 
tcccaggccc 
gacccagcat 
cagcatccca 



159780 
159840 
159900 
159960 
160020 
160080 
160140 
160200 
160260 
160320 
160380 
160440 
160500 
160560 
160620 
160680 
160740 
160800 
160860 
160920 
160980 
161040 
161100 
161160 
161220 
161280 
161340 
161400 
161460 
161520 
161580 
161640 
161700 
161760 
161820 
161880 
161940 
162000 
162060 
162120 
162180 
162240 
162300 
162360 
162420 
162480 
162540 
162600 
162660 
162720 
162780 
162840 
162900 
162960 
163020 
163080 
163140 
163200 
163260 
163320 
163380 
163440 
163500 
163560 
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ggcccagcac 
cagcatccca 
atcccaggga 
cagggacagc 
tggaggcagc 
cctcattgct 
tcccagtacc 
gtctggggtt 
cagcacatcc 
tggagcctac 
tgggagtcat 
gtatgggaat 
ctgcacggga 
aactcaggat 
ccagggctaa 
tctgtcatcc 
ctacttctcc 
aatcttctgc 
tcccttttac 
tttctncncn 
nnnnnnnnnn 
aaattgaact 
ccttgtgatc 
tagtggttct 
aggcattgct 
tcttaagatg 
agcatgtgct 
acaaaaccaa 
attgggacac 
gaggatgaaa 
aaaatatata 
ttgactgtat 
cccaaaattc 
caggaatgct 
acacttccag 
atgtccttgg 
aagtatttat 
ctaggcctag 
cagcctttag 
tgattcagtc 
gtcttttctc 
tctaatctct 
ttctctcttc 
tgtgccgctc 
ctgtgctgca 
ctctcttctc 
tttgtctgcc 
tttaccttca 
tgagagtaaa 
tctccataaa 
ctgccatgtg 
aataagcatt 
cgactacatt 
agtgaatttt 
ttatattaag 
tttcttcata 
cttatggaaa 
cctcccatct 
tgaactcctc 
gagataaggg 
catgacccct 
ctagtgatgt 
agcctatgaa 
cgaccccaga 



cccaggccca 
ggcccagcat 
cagcacccca 
accccaggcc 
acatactgaa 
ttcagcacct 
aggattcccc 
ggcagcccca 
tgcccattag 
cctgtatagc 
ttcacctagg 
aggtaaggag 
gattgtgtca 
ggtccagaat 
gacccttcct 
ccatctgtgc 
cagcatccca 
taggtttgag 
caccaccact 
nnnnnnnnnn 
nnnnnnnnnn 
tacagaggaa 
agaaagctat 
ggggactgaa 
tatgctaagc 
cctataacca 
caaaaacaca 
acaacaacaa 
atttatacta 
agcccctgca 
gtgtatttat 
atctttgaaa 
ctgagttgga 
ctgatctaat 
gaggcagaaa 
agaggctacc 
ttgactgatg 
ccctggaagc 
gacttgctgc 
cccctgttcc 
ttggcctctg 
tgtctccttc 
catccttctc 
tactctccct 
ctctcttctc 
ctgagggttg 
actcaattag 
ttgtttcaaa 
gatgtgtgct 
taacagaatc 
gtgaagagaa 
ttcttgggag 
ttgccatgcc 
aaggcagcca 
aattgggaac 
tacaacttag 
ctgaagaata 
agagattgtt 
accctagagt 
cctcctaaaa 
tagttacgta 
aaacttgtac 
aaatttggct 
gctctggtct 



gcatccaagg 
cccagggaca 
ggcccagtat 
cagtatccca 
gatagggaat 
gctctcctca 
tcagctgagg 
tgctaattgg 
gcctaacctt 
tctgtattga 
caactccaga 
agcatagtga 
ctgtggttcc 
ccagagctct 
tggacggttt 
ctaacagctc 
tgcccgcacc 
ggccccaggt 
aatcctcttc 
nnnnnnnnnn 
nnnnnnnnnn 
acacaaatga 
agattaaacc 
cccaaggaag 
tgacaaaggc 
tgggtggagc 
cataaagatt 
aaagtactta 
ttctcgttct 
gaacaacgca 
gtaaaacata 
agaaccagca 
ggttcaatgc 
caatggattg 
caaggtgagg 
tcatccttga 
taactgtcat 
ttctagcttc 
tgagatcacc 
gggctcaaac 
aattgctctg 
acactctctt 
tgtaaagctc 
ctcagctcta 
cggtaccacc 
ggcagatcct 
acatcacttt 
ttaaaggtga 
aataaggctg 
ttagggttca 
tgcagagagt 
ttggattaga 
ttgagactgt 
aggctgtggt 
aaaaaagcag 
tttttatagt 
aaaattttta 
cccggaacac 
tcgaaccctc 
caacctcaaa 
gattcccttg 
tttccctgcc 
ggtcgtcgat 
atgttccatg 



cccagcatcc 
gcaccccagg 
cccagggaca 
gggacagcac 
gtctctgagg 
cactcggaat 
aatgggtagc 
actgacagtt 
gcctgcaggg 
ggcactcccc 
tgggcacaaa 
gcctggctgg 
agactgccaa 
taagagagca 
cttggctttc 
tctgtggaaa 
ccagctcact 
tgaggctgtg 
ctcctcttcc 
nnnnnnnnnn 
nnnnnnnnnn 
gctgtaatga 
ccagcccttt 
gacagctgct 
tttctgaatg 
aattctctaa 
gtttactgaa 
atagctcatc 
atcacacgac 
cacaatgaaa 
cataaacaca 
gagtgctatg 
agagttcata 
accaactgat 
cctaactgga 
tccttccctt 
cttggaggct 
catacaatct 
gtttcctgtt 
tcctctcccc 
cttggtctca 
gcttgttctg 
tcccggtaaa 
ctgcactgct 
tgtgtctccc 
atcctgtcaa 
caagcaggag 
gtactaaggg 
agccaactct 
caatacgatc 
taaggatgtg 
gtcattcctg 
ggagggctga 
tggactgtta 
agtagaaagg 
tagcttttta 
ctgaactctt 
tcctgaactc 
ccaactaaaa 
atgaaccggg 
gcagaacccc 
cagctctccc 
tctcctctac 
tgctttcttg 



cagggacagc 
cccagcatcc 
gcaccccagg 
cccaggcaca 
cctcttatct 
caaacaccct 
taccattttg 
tctcctgaga 
gtgtgctgta 
caagctatga 
aatctctcca 
gcacctgaga 
gacatcttgg 
gatgctgaga 
tactctgtcc 
acatgaggcg 
gtgtgccctc 
ggtttccctc 
tcctcttcct 
nnnnnnnnnn 
nnnnnnnnna 
cagaaaagga 
ttgcctttgt 
tggaatgttg 
catatggtag 
gagtctgtct 
gcagtatttg 
agtaggaaaa 
tattaagaag 
ttgaatagag 
taaaaagaat 
gtttataaat 
ctcttaggaa 
agattaataa 
ggaagtttgt 
tctccgcttc 
tactggctcc 
aatccaagcc 
ctttctgaac 
gatgatttta 
aactaactct 
tctttactgt 
cctgcctcct 
ctccacagct 
ttacgtagct 
acctttctct 
tgcctcctct 
tgtgtctctt 
agctagaaat 
aaatatcctg 
ctcttgaggt 
tgacactgtg 
gcgatggatt 
ctagctacgt 
acagttttgg 
tgttgctatg 
cttcacccca 
ttcaccccag 
actgttccaa 
tacattgcca 
ttgtcccttg 
cccttgagtt 
accactaggt 
ctgttgttct 



accccaggcc 
caggtccagc 
cccagtatcc 
gtatcccaca 
tggtccttac 
gtgcaggttc 
gcttttgtct 
gcaatttggg 
ggggcaggga 
cccatgccag 
ataagggtag 
cctgagcagc 
ctttcacccc 
ggcacttaac 
tctgtcccag 
tatgagctct 
atgttactca 
catctgtccc 
cctcctcctc 
nnnnnnnnnn 
caaaggacgc 
atactatttc 
ttttcatatt 
gagacctctc 
tatctattca 
gaacttctta 
caacagtgaa 
agaacaacaa 
attaaggtgt 
aataaagaaa 
agaaaactct 
gaaaagttcc 
gtaactaaat 
tctgaaggca 
caccagaggc 
ctggctgtaa 
atcagctaac 
tagaatgttc 
tctagctggc 
ttcacaatct 
agcaatcttt 
gtctagtttg 
cctccccctc 
ctcctgtatc 
tccctttcct 
gattcttcac 
acaaaccaac 
tttcagccag 
agtttctttt 
agacagctgg 
ttcagatggg 
acaaaggact 
aatacactgg 
ttagctggat 
cagaaggagg 
accaaaatac 
gaacccgacc 
aatgctttcc 
gaacattttt 
aataatagga 
acagaacccc 
ttactatata 
gcatgagttt 
attaaatctt 



163620 
163680 
163740 
163800 
163860 
163920 
163980 
164040 
164100 
164160 
164220 
164280 
164340 
164400" 
164460 
164520 
164580 
164640 
164700 
164760 
164820 
164880 
164940 
165000 
165060 
165120 
165180 
165240 
165300 
165360 
165420 
165480 
165540 
165600 
165660 
165720 
165780 
165840 
165900 
165960 
166020 
166080 
166140 
166200 
166260 
166320 
166380 
166440 
166500 
166560 
166620 
166680 
166740 
166800 
166860 
166920 
166980 
167040 
167100 
167160 
167220 
167280 
167340 
167400 
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gccttctaca 
tgagtgcttg 
acagcgcgac 
ggtctgatgt 
gctcagtgag 
tgtccaccgt 
cctggtggac 
tcccacctcg 
agaccgtggg 
ctcgtgcaga 
tgtggtcgac 
ctcttgttgt 
tgactctgga 
gaaagggtcc 
cacctgaggg 
aggaaggggg 
tcccacctcc 
cccagtctga 
ttgacgacct 
ctgcagctgc 
ctcggagtcg 
tgccactcag 
agtactggcc 
cagagaaccc 
cttgggatga 
tcctcataga 
acctcgtgga 
gtaggggacg 
gacggcccac 
cagtcttcct 
cagaggggca 
agaaaaagct 
aagcagaaaa 
aagaaataga 
ttttggccgc 
acagggcagt 
attgcaaaga 
tcctaaccct 
taactttgtc 
cagtactcac 
ctggtagtaa 
tgacccactc 
taaccaaact 
aggcccctgt 
aacccaaaaa 
aagcaggaat 
ccacccccat 
ctcatatcca 
caccacttct 
gggaggttaa 
taagctctct 
tttgcctgcg 
agggcggaca 
ccaccctgtt 
agcttaccct 
gtcaccaggg 
ctaaaaaggc 
gcaaaagatg 
ccccacggca 
gctttgcaac 
ggaaagaaga 
cgctagcatt 
tagccagggg 
caaaaaaatt 



ttttgagtac 
agtgagggtc 
cacccagagg 
ttgtgttctg 
accgcgctcc 
ccgttcaccc 
ccctttggag 
tgcccagttg 
ttcaagtccc 
gggtctcaat 
gcagagtcgc 
gtctactgtt 
gcattggaag 
gtggcagacc 
tgcttttgac 
tcaccctgat 
gtgggtcaag 
tgcagccgga 
cctctggata 
cccaccacag 
ccgaggccga 
agcacatgtg 
tttttcctct 
ctctggactt 
ttgtcagcag 
ggcgagaaaa 
cgaggctttc 
cctccttgtc 
caatttggct 
tgagcgtcta 
gagagccgct 
gcaaaggctg 
agtctatcat 
ggagagggaa 
agttgtgaat 
gaaaccgcaa 
gaaaggacac 
agaagatgat 
cgtggagggg 
taacccccta 
attttacccc 
ctttctggtg 
aaaggctcaa 
tgcctgcctt 
tgcagtctct 
ggggttggct 
ttcggtaaaa 
gaggttgctg 
gccggttcaa 
caaaagggtc 
cccacctgag 
tttgcaccct 
gactggtcaa 
tgacgaggcc 
actacagtat 
aactgagagg 
acaaatctgt 
gctcacagag 
ggtacgtgag 
cctagcagca 
acaccaaaga 
accagacttg 
agtattgaca 
agatcctgtt 



ggtctcagtg 
tcccttcggg 
tcctagaccc 
tttctaagtt 
gagagggaac 
tgggagacgt 
gccaagagac 
cgagatcgtg 
acctcgcgtt 
cggccggcct 
cgccgtttct 
tttctagaaa 
gaggtgcggg 
ttttgcgcct 
ttgtcactaa 
cagatcccct 
ccttggaccc 
aagtctggcc 
gactcccaac 
ggaccaatag 
agccccgggg 
agagggccag 
tctgatttat 
actgggctcc 
cttttgcagg 
aatgttctgg 
cccttgaacc 
tatcgccgga 
aaggtaagag 
atggaggcat 
gtagccatgg 
gaggggctcc 
aagagggaaa 
aatagacggg 
gatagacagt 
ggtggcaaaa 
tgggctagag 
tagggaagtc 
actcccgtca 
ggcaagctag 
tggacgacca 
atacctgagt 
gtccaattta 
gtcctcaaca 
tcaggttggc 
aaacaagtgc 
caatacccca 
ggccaaggag 
aaaccaggga 
ctggacattc 
agaacatggt 
aagagtcagc 
ctaacttgga 
ctccatcggg 
gtagatgatc 
ctcctcacag 
caaactgagg 
gcccggaaaa 
tttctgggga 
cctctatatc 
gcttttgagg 
actaagcctt 
caagcactgg 
gctagtggat 



tcttcttggg 
ggtctttcat 
acttagaggt 
tggtgcgatc 
gcggggtgga 
cccaggaaaa 
catttggggt 
ggttcgagtc 
tggtcacgag 
tagaaaggcc 
ggtttctttt 
tgggacaatc 
tcagagccca 
ccgagtggce 
tcgctgccgt 
acattgtgac 
caaactcttc 
catcagcacc 
ctccccctta 
cgagaggggc 
aggaaggggg 
caccaggacc 
ataattgaaa 
ttgagtcact 
ttctttttac 
gagaggacgg 
gccccaactg 
ctctagtggc 
aggtcttgca 
ataggagata 
ccttcattgg 
aagatcatac 
cagaagaaga 
atcgccgtca 
caggaaaagg 
agataccact 
attgccctaa 
ggggctcaga 
acttcctgat 
gctccaaaaa 
aacgagctct 
gccctgctcc 
cttcagaagg 
cagaaaaaga 
taactgcgtt 
ctccggttgt 
tgagcaagga 
ttttagtggc 
ccaatgacta 
accccacagt 
atacagtcct 
tcctgtttgc 
ctaggctacc 
atcttgcgcc 
tcttggtcgc 
aactgagtga 
taaccttcct 
agactgttat 
ctgctggctt 
ctttgactaa 
ctatcaagtc 
tcgtcctata 
gaccctgaaa 
ggcccacatg 



tccgcggctg 
tttggtgcat 
aaggttcttt 
gcagtttcgg 
taaggataga 
acaggggagg 
tgcgagatcg 
ccacctcgcg 
atcgtgggtt 
atctgattct 
ttgtcttagt 
tgtgtccact 
caaccagtcg 
aacgtttaga 
caggcgaatt 
ctggcagaat 
gaaactgacg 
ccccaagatc 
ccccctgccc 
tcagggaccg 
gccagactca 
taatgatctc 
aactaaccac 
tatgttctcc 
aacagaagaa 
cacacccact 
ggactacaac 
aggtctcaga 
ggggcagact 
cacccctttt 
tcagtccgct 
gctccaagat 
gaggcaggag 
ggagagaaat 
taaaataggg 
ggaaaaagac 
aaagcgggag 
ccccctccct 
agacaccgga 
gaccatggta 
tcagatagac 
cctcttgggg 
cccacaagta 
gtaccggttg 
ccccaatgtc 
ggtagaactt 
agctagaaaa 
ctgtcagtcc 
tcgcccggta 
cccgaacccg 
agacttaaaa 
ttttaaatgg 
acaggggttc 
ttttcgcgct 
ggcggcctcg 
cttggggtat 
ggggtatacc 
gatgatccca 
ttgtagactc 
ggaaggggtt 
gtctctaatg 
tgtggacgag 
aagacctgta 
tctgaaagct 



tcccggggct 
tggccgggaa 
gttctgtttt 
ttttgcggat 
cgtgtccagg 
accagggacg 
tgggtttgag 
ttttgttgcg 
cgagtcccac 
ttgagttgct 
ctcgtgtccg 
cccctttctc 
gtggaagtca 
gtaggctggc 
gtttttcagg 
ctcgtccaat 
gtcgcggttg 
tatccagaga 
caacagccac 
gcgggggaga 
acagttgcct 
attcctttac 
cctcccttct 
catcaaccca 
agaaaaagaa 
gccctcccta 
accgcggaag 
ggagccgcta 
gaaccaccct 
gaccccttgt 
cccgacatta 
ttagtaaaag 
agagagaaga 
ctgagtaaaa 
ctcctgggca 
caatgcgcct 
cgatccaagg 
gagcctaggg 
gcagaacatt 
attggagcca 
aaaaatatag 
cgcgatctgc 
agctggggaa 
catgaagaac 
tgggcagaac 
aaagctgatg 
ggcatccggc 
ccctggaata 
caagacctcc 
tacaatttat 
gatgccttct 
agggacccag 
aaaaattccc 
cgaaaccctc 
aaggagctgt 
cgagtttcgg 
ctccgagggg 
tcgccaacta 
tggattccag 
cctttcaagt 
actgccccca 
agagcgggtg 
gcctatttgt 
attgcagcag 



167460 
167520 
167580 
167640 
167700 
167760 
167820 
167880 
167940 
168000 
168060 
168120 
168180 
168240 
168300 
168360 
168420 
168480 
168540 
168600 
168660 
168720 
168780 
168840 
168900 
168960 
169020 
169080 
169140 
169200 
169260 
169320 
169380 
169440 
169500 
169560 
169620 
169680 
169740 
169800 
169860 
169920 
169980 
170040 
170100 
170160 
170220 
170280 
170340 
170400 
170460 
170520 
170580 
170640 
170700 
170760 
170820 
170880 
170940 
171000 
171060 
171120 
171180 
171240 
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tagccctgct 
cccctcatgc 
cgaatgacac 
atcctcaacc 
atggacatcc 
agagctccca 
ggagctgcgg 
tcggcacaaa 
atcgttaata 
atctacaggc 
attctggccc 
ggccacccaa 
aaaacaagtt 
tgaggatggg 
tttttgaaag 
accaattcac 
aagtccccaa 
tcactaatgc 
gagtgtattg 
tgttagtatt 
ctgcccagat 
aggtaatcgg 
ctcagttggg 
tagagaagat 
gaaaagactg 
actccttttg 
gtttccgact 
gtgattagga 
gtaccccacg 
cttgagccac 
gttgacggaa 
gaagaaaacc 
ctgcgccgca 
gtcctgggag 
cctctggatt 
caattgggac 
cccaaatggg 
agctgcacat 
cgggggggca 
ctggaagccc 
gtcaaacgaa 
cagagcaccc 
cgagaacgga 
atacattcca 
ggcagtgaca 
tggctccccc 
taccctccct 
taggaacaga 
ctaaccctaa 
aaggaatagc 
gaaaaaacag 
tacctcagga 
gtcagtatct 
gtgtgtctat 
ccagactctt 
gaaaaagaga 
gagtaggtac 
gcagttatgg 
ttaacttccc 
aaaaaaagag 
ggagtaatca 
agaaaaagcc 
ctcctctcca 
tgcatcctta 



gatcaaagat 
cttagaaagt 
actatcagag 
cagctaccct 
tcgctgaaga 
gttggtacac 
tggtagacgg 
aggctgaact 
tctacactga 
agcgagggct 
tgttagaagc 
aaaggagaaa 
gctcaagggg 
aggtataaca 
aaaaatagaa 
ccacctggga 
cctgaagtca 
aactaaagcc 
ggaggtagac 
tgtagacacc 
tgtggccaag 
gtccgacaat 
catcgattgg 
aaataggacc 
ggtggctctc 
aagttctgta 
ctgaccctgt 
cccagatttg 
ggttccgagt 
ggtggaaggg 
tcgcctcctg 
acgacgacaa 
ggcgccactc 
gtgcttaatg 
tggtggcctg 
ctctcagatc 
atagggagca 
ttttatgttt 
tcagattact 
tcctctaaat 
ggagaaagaa 
tcaggaccat 
aaacaacacc 
ctaagagatc 
ctcgtagggc 
aaaggtcccg 
aggaactaat 
ggaccgtctg 
tatgactcaa 
tcagatcagg 
aaagttgact 
taaagggcac 
ggtgcctccc 
gtctgttttt 
gtatcatgat 
acccgttacc 
aggaaccgct 
atattgatct 
tgtccgaagt 
gactctgtgc 
aagattctat 
aacaaagatg 
ctatagcagg 
ataagttagt 



gctgacaaac 
atcgtgcgac 
cctgctgcta 
tctccctcta 
aactgggacc 
ggacggcagc 
gaaaaaggta 
tatagcgctt 
cagccgatat 
attgacctcg 
catacatgca 
acttggtggc 
ccatgatctt 
taaaagagct 
ttaactcccg 
gttgaaaaaa 
gtggctcaaa 
tacagagaac 
tttactgaag 
ttttcaggat 
aagatccttg 
ggaccagcct 
aaattacact 
ttaaaagaga 
cttcctcttg 
tggaggacct 
cttaccctcc 
ggaccaactg 
tggagacaaa 
accctatttg 
gatccacgcc 
ttggacagtg 
tagacctagg 
aaaaggaaaa 
atctcacgcc 
atactgatct 
catatgggtg 
gccctggtca 
tttgtggtaa 
gggacctaat 
acccctataa 
gcaaagataa 
gtctaagttg 
ctgggttcat 
ccaacaaggt 
actgtaccag 
actctcctca 
gtcagtctag 
tcatgctggt 
acttataata 
ctagcagcag 
ctctgtaatc 
ctagacacag 
aatagttcca 
aatagttctt 
ttaactttgg 
gccttaatta 
tagaactata 
ggtgctgcaa 
tgccttaaaa 
ggctaaactt 
gtttgaaagc 
acctttaatt 
agcttttatt 



tgacaatggg 
agccacctga 
aatgagcgtg 
acaaatgatt 
agaagtgacc 
agtttcctga 
atttgggcaa 
atacaagccc 
gcttttgcta 
gctggtaaag 
cctaaaaagg 
caagggcaac 
aactgaaaaa 
atggtagacc 
aagaaggaat 
tgatgagact 
agattataga 
ctggaaagag 
ttaaacctga 
gggttgaggc 
aagaaatcct 
ttgttgccca 
gtgcttaccg 
ccttgactaa 
cgctcaaaca 
ccccccttaa 
tctttgctta 
aaagcagcct 
gtcttggtca 
gtgttactga 
tcccacgtca 
gcagtcactg 
gaaccttaac 
cattgtatgg 
tgacatctgt 
tagcaaccca 
ttcggggcag 
gggtcagagc 
atggacatgt 
cacggtaaaa 
atatcaagag 
atactgtaac 
gcttaaagga 
tttcacgatc 
ccttataaaa 
ctccaccaac 
taaagcctac 
tccaaggagc 
tatgctatgc 
ctacttcaga 
tttcaggaag 
agacccagaa 
tgtgggcttg 
aagatttctg 
ttttagataa 
cagttctatt 
agaccccccc 
gaacagtcta 
aatagaaggg 
gaagaatgtt 
agagaacgcc 
tggtttaata 
acacttatgc 
agaaaaagga 



acagcaggtg 
cagataagat 
taacctttgc 
ccgtcccagt 
tgactgacca 
tagaggggaa 
gcgctttgcc 
tccgagaggc 
ccgcacaca.t 
acattaaaaa 
tagccatcat 
cgaatggcag 
ggtgatccgc 
agtgatcccc 
aaaatttgta 
aattaaaaat 
ctcctgcaaa 
acaacgggga 
aatgtatggt 
atttcccact 
gccaagattt 
ggtaagtcag 
ccctcaaagc 
attagccatt 
cccctggtcg 
tggaagctgg 
ttcatttaaa 
ataccccagg 
gacggcatcg 
caacccctac 
agagggccgc 
acaatcctct 
cctcatgctc 
gcaaccactg 
aagttagcgg 
ccccctgagg 
ttctaccgag 
aaaaggcttc 
gaaacgacag 
cgaggtagtg 
agtgggtgcg 
cccctacgta 
aataggtggg 
agattgacag 
caggggcccc 
tccacagccc 
cttggcttcc 
ttttttagtt 
ctctagcccc 
tcattctcaa 
agggctttgt 
catccagtct 
caataccggt 
cattttggtt 
atttgaacat 
aggattggga 
aatactatga 
taaccaaatt 
aattagactt 
gtttttatgt 
tagatatacg 
agtccccttg 
ttttgcttac 
taaacgcagt 



accgttgtag 
gacaaatgcc 
gccccctgcc 
acatcaatgt 
accctggcct 
gcaaaaggct 
tgaaggaaca 
taaaggtaag 
ccatggggcc 
caaagaaaaa 
ccactgcccc 
acttagtggc 
ccaaaagccc 
tcccatactt 
aaaggactac 
tcccgatacc 
ccatgtgcat 
gaccatcctg 
aacaagtatc 
aaaacagaga 
gaaatcccta 
ggcttggcca 
tcaggacagg 
gagaccggca 
tttcgggctc 
tggaacatta 
ggccctaaaa 
gaccaccgca 
aaccggtagc 
tgcggtaaaa 
cagtcaagat 
taagcttcgt 
caattcaaca 
cagtccatcc 
caggatcccc 
agcggtgtgt 
ctaatcttag 
aacaaaaatg 
gagatgctta 
gctatgataa 
cttttaaaaa 
taaggttcac 
gttggcgagt 
tgagagaccc 
ccagtcgtac 
aacacagtgg 
ccaccgcccc 
ctaaatagaa 
ccttattata 
tgcctttggg 
ctgggccggg 
agcaaaagcg 
ctcactcctt 
cagcttattc 
cgggtccgct 
gtagcagctg 
agaactacgt 
agaagaatct 
attattcctt 
tgaccattca 
taaaagagaa 
gctcaccact 
ttttgggccc 
ccaggttatg 



171300 
171360 
171420 
171480 
171540 
171600 
171660 
171720 
171780 
171840 
171900 
171960 
172020 
172080 
172140 
172200 
172260 
172320 
172380 
172440 
172500 
172560 
172620 
172680 
172740 
172800 
172860 
172920 
172980 
173040 
173100 
173160 
173220 
173280 
173340 
173400 
173460 
173520 
173580 
173640 
173700 
173760 
173820 
173880 
173940 
174000 
174060 
174120 
174180 
174240 
174300 
174360 
174420 
174480 
174540 
174600 
174660 
174720 
174780 
174840 
174900 
174960 
175020 
175080 
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gtactaaggc 
tatctcctaa 
cagaacccga 
agaatgcatt 
tagaacattt 
caaataatag 
tgacagaacc 
ttttactata 
gtgcatgagt 
ctattaaatc 
tgtcccgggg 
cagaggaaaa 
agacaagatg 
tgtgtggtgg 
tacctctgtc 
cagataggaa 
ctcaatagga 
gctggccagg 
ctctgtgagt 
aggagcctgt 
tttaatcccc 
attggagcta 
agcttatctc 
cctccacaca 
cagctgcaca 
tattagtttg 
ccttaaaaca 
aagagtactt 
agcaatagcc 
atgagcacca 
ttcagccttc 
aaggctaaac 
ttgtagttat 
ccagtcacct 
ataatggtat 
agccatcgca 
attataaatc 
ccatcctaga 
ggacattgag 
tgttggacta 
gggtgtgtgg 
atttcgaggc 
tgtttcaaca 
taatacatgt 
ttctgctagc 
ggtcctcctg 
ggttttggtc 
catctggcaa 
tgtgagattt 
ttttcattcc 
tgtcaatcca 
ctagggtggt 
agaaaatggc 
gaccttgctg 
ccttaggagt 
accaaatgtg 
aaaatagatt 
aactggacaa 
cataatgagt 
ctcatgtagt 
tgcacgctgg 
acagaatagc 
gaggaagagt 
gttaattgac 



aacaatatcg 
aagaagtggg 
cccctcccat 
cctgaactcc 
ttgagataag 
gacatgaccc 
ccctagtgat 
taagcctgta 
ttcgacccca 
ttgccttcta 
cttgagtgct 
atgtattctg 
gaacagctca 
acaggaaaca 
cgtcaggccc 
ccaagtatca 
gtgtgttcca 
agagctgcac 
tcaaggcaag 
ttcaaaacaa 
atgacctcaa 
gtgagatggc, 
tggggacaca 
tatgcctccc 
tggtcatgca 
agatcgcata 
caaaaatcaa 
tgtagtaagg 
ttccgagtgc 
gggtacactg 
ctaactgctc 
tactgctcag 
gcagaattca 
cttaaaggtt 
tccaccctaa 
ccagcccatg 
tgttggtgag 
ctgaagacca 
cctcatggac 
cccagacaat 
cacacacctc 
ctggtctata 
aaaccaaaac 
gggtaccagg 
ctagtgaagc 
agctttcctt 
ttaagaggtt 
atgtcttgac 
tgctcctcac 
tgagaactgg 
tcacaccaga 
cactttgctg 
taacgactgc 
cagtctgaag 
attgcttgct 
aagaaatatg 
tattacagag 
atgttggtac 
tagtttgctt 
ctatggtctc 
caatgtaagc 
tctgtgatga 
ctcagtgact 
atggaaagat 



ggtccttcag 
gaatgaagaa 
ctagagattg 
tcaccctaga 
ggcctcctaa 
cttagttacg 
gtaaacttgt 
aaaaatttgg 
gagctctggt 
cattttgagt 
tgagtgaggg 
cctcatgggt 
acctgctggc 
gagagctgcc 
tacatcttaa 
aaccaccagt 
ggatgtggac 
acctttaatt 
cctgggctat 
acaaaaccaa 
atagccaaac 
tcaacatatg 
cataatggga 
acaaataaga 
tgcctataat 
gcaagcagta 
ttaagcaaca 
acaataccaa 
tcagggatgt 
ctctcagagc 
aggcattcag 
catggctgtg 
agagttatag 
ggatgcacca 
tccccaaaga 
gccttgaact 
gtattccttt 
gcagctctcc 
tcagccgcta 
actatgtaag 
tactcccagc 
gagtgaattc 
cgtaaattca 
ggttgaactc 
tgaggcaggt 
tgagagccta 
ctggacttgg 
aaaggagatc 
cactgcagga 
ggaaaactga 
cctaaccaac 
ggctcatcct 
taagaagcag 
tctttaaaat 
gtcacctatc 
gcatcaatac 
tgcaagggaa 
agttcatgaa 
ttttttttct 
aaactcacca 
atgtgccacc 
ttaacttcaa 
gggtgggcta 
caagtccatt 



gaggttgaaa 
taaaaatttt 
ttcccggaac 
gttcgaaccc 
aacaaccgca 
tagattccct 
actttccctg 
ctggtcgtcg 
ctatgttcca 
acggtctcag 
tctcccttcg 
tcagggggtt 
aggagggtgt 
tacagtctta 
aggatctaca 
ttgtagggga 
aaggagaaca 
ccatcactcg 
acaattctag 
accataaaaa 
agctctcctc 
aaagtccttc 
gaggaccaat 
aaatatatat 
ccagcactcc 
ggctagacag 
ataacagtaa 
aaatgttcct' 
ataaatactt 
tgcccagaaa 
aggtcacttc 
gtacttggca 
catcatgaaa 
agtgcctttg 
cttctggcca 
cttgatggtc 
gctgataata 
aggagtcctc 
ctagattcgc 
ccaatcccat 
acgcaagagg 
caggccagcc 
ttctatcagc 
aaagtcttca 
acagccggtt 
caaagaactc 
atctgtagct 
acttgtgttt 
ctgcagatct 
ctttgtattt 
tgccaagagt 
catccttggc 
tctttccttc 
gtgtttgtta 
atattctatc 
ccactgcaaa 
aagaaaaaaa 
gaggttctac 
ctttccttcc 
ctgtaaatca 
aggcctggct 
tcatcaactt 
agggcatgct 
gtgagcagca 



actcgctcta 
tactgaactc 
actcctgaac 
tcccaactaa 
aaatgaaccg 
tggcagaacc 
cccagctctc 
attctcctct 
tgtgctttgt 
tgtcttcttg 
ggggtctttc 
tccctcagca 
gggaaaggac 
caggcctacc 
gtttattaaa 
taaaaataca 
cagttgttta 
taagagggaa 
attagccaga 
agcatttctg 
agtccaaacc 
ccaaaaatat 
ttctacaagt 
aataaaaaga. 
agaggctgag 
ggctacatag 
caaccacaac 
ttaaggacag 
agaaaacttc 
gttgtttatc 
tgtagtagcc 
ttatcatttt 
gtttccacca 
gggtgataaa 
tctcataatg 
ctgtctcagc 
caagcaaatt 
aatgcagact 
aacctattca 
tttaatacac 
cagaggcagg 
agggctacac 
tctatttcct 
tgtttacgta 
ctttactgct 
tttttttctt 
gtcatatcac 
gctgcagtgt 
attctgcctt 
gggctttgaa 
tctgctgact 
ctgcagttta 
cagaaatttt 
tggtagaata 
aggaagcaga 
aagtgtaaat 
tcagccagtt 
aaaatggctg 
ctttcctttc 
ccttgaactt 
cacacatttg 
gacataacca 
cataagggat 
acacgccctg 



agattagagc 
ttcttcaccc 
tcttcacccc 
aaactgttcc 
ggtacattgc 
ccttgtcccc 
cccccttgag 
acaccactag 
tgctgttgtt 
ggtccgcggc 
aaaactactt 
aattcaggga 
aagtgttcat 
accactgacc 
agaacactac 
aggaacacat 
aaagcttaac 
gcaggttcat 
gctacatcgt 
aggctttggg 
aaactgcaaa 
tgacaactgt 
taccctctga 
aagaagtcta 
gcaggaggat 
tgtaaacctg 
aaaaacccaa 
ttctggaatc 
ccctggagaa 
ctggattcat 
aatgtctaaa 
gtgactggtt 
agttcctgat 
ttatattcaa 
taaaatgctg 
ctgtgtttgg 
cttcaagctt 
ggcccagctg 
gacaagccac 
atattcatct 
cagatctctg 
agagaaaacc 
tagagaattc 
gcaagtttcc 
ccttgcaaat 
taggtctcca 
agacattcaa 
cccttctggc 
tttagttgac 
tttgtccatt 
tttgttttct 
tccccaggaa 
agtctatcta 
ttttgagttg 
cgtcccattt 
aaataataaa 
tcagaattgt 
ggggtgggaa 
cttacaaggt 
ctgatccttc 
gtttttcaat 
agaatcgtct 
tatcctgatt 
aacagaagtc 
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ttctgaagta 
cgtgtttatt 
aacatgaact 
tatgctaaga 
agctttggaa 
atctgatttt 
agactcagaa 
tgtcattgtt 
ccaggctaga 
tttgagacag 
ctggcctcga 
gtgccaccac 
tgttcctgcc 
tagtaaacca 
cccaaccatt 
tttttcagct 
ggttctgatt 
tgcggtaagc 
ccaggtttgt 
cattggggca 
gatttggcaa 
caacacaaca 
aagagcaccg 
tcacaaccat 
tgtactcaca 
atgtacctca 
cattttctta 
cttgacacag 
ctctggtggc 
agaagcaaag 
tacatgcatg 
tcattaaaaa 
gctttgaaag 
aaccttttga 
taatggcagc 
tagtgttttt 
acctttgtgt 
ggcttacaca 
ccagctcact 
gggttcatgg 
ccatccttag 
ttagtttgat 
aacgagtttt 
agttatgatc 
tattgcctga 
ttgactggtt 
cctccagact 
tgtgatcatg 

gggtggaaag 

agaaatgtca 
tttatatagg 
aactctaagc 
acccttcaag 
tacgcacaag 
acacaactta 
acatgggaag 
nnnnnnnnnn 
nagagagaga 
gctgagtcag 
cgctgtccca 
gctgcctcat 
cagctgggcc 
gtgctttttt 
aatccttgca 



taagaggaga 
ctctgtctgt 
gcaccctgga 
tattttatcg 
ctgaacctaa 
gatggtaaaa 
aagaagttag 
ctgaatgtaa 
atctgaagga 
ggtttctctg 
actaagaaat 
accaggctag 
aatgtaccaa 
gaataactta 
tttggaattc 
tttttcaagt 
agcatttcaa 
agccacacac 
ttctctatcc 
cgtgcaactc 
tgtcaccctc 
cctctcatct 
actgctcttc 
ctgtaatggg 
tatattaaat 
gagtccaaat 
ttgaggttac 
cagacaagca 
acttcctgcc 
gacctactct 
cacatggtac 
gaaaaatcag 
ccaagaagtg 
caaaactaaa 
agtgggatta 
gtgtttattc 
gggtcctaga 
gtaggtggtc 
ggccttcttt 
ccagtgctct 
ctgtggtcct 
catctctctt 
ctacttctgt 
ctcacagtcc 
ccctgctcgg 
ttgttccaat 
gcagggatta 
tggtgctcag 
attccaagaa 
gggacatttc 
ttaacctgaa 
aactaaggaa 
ccccacccaa 
taacatcata 
aaaagagagg 
gttggaggca 
nnnnnnnnnn 
gcctattatg 
aactagccta 
gatttgcctt 
gctaatcagt 
agagccacta 
ttttttttta 
ataggctgca 



aagcttgatg 
tcttgaccgt 
attgcaaggc 
cagcaataga 
ggctgttata 
tcatgtgttt 
gacttgaatt 
gtgcctcagc 
actcctctat 
tgtagccctg 
ctgcctgcct 
gaactcgtct 
atagtcaatt 
tctagccaaa 
tgtgagggga 
tgcaggtcat 
aatgaggcaa 
tcttgcttgg 
taggcctctc 
aaacacggag 
ctcccttctc 
caagaattgc 
tgaaggttct 
atctgattac 
aaataaatct 
gcttcttcct 
tgcttctctg 
gttctttaaa 
atgactgatg 
ccaaagccgt 
acacacatac 
ggctcagcaa 
gatattaatt 
aaatataaca 
gtttcttttt 
atattccaca 
gattgaactc 
aggcttgtat 
gtttaaaaaa 
ttatcttaaa 
aggagaaaaa 
tgaagatata 
tcaacaaact 
ggtgggtcag 
tgcatccaca 
ttttagtata 
atgttcttga 
cactcttgca 
ccagaggaag 
tagacctctg 
aggggaaaaa 
tgttgggggg 
gctggttatc 
tggatgggca 
ccatgaattt 
gnnnnnnnnn 
nnnnnnnnnn 
tcgttggttg 
gctgagacac 

tgggggccct 

tgaggttgtg 
aggggagtct 
aatgttccct 
agacaggcaa 



agagcaagca 
agatgtgatg 
aaacaaacct 
aatgaaactt 
attcactagg 
gtttctggat 
ccgttttaga 
tgaaaataca 
tttttgtttg 
actgtcctgg 
ctgcttccca 
attacacatt 
gattcctgtt 
gtctgcctat 
atccacagat 
gattcagtgg 
gcagagggca 
aggcttagtc 
tcttaaattc 
tgtgactcct 
cctagggtca 
cattagggct 
gagttcaaat 
ctcttctggt 
aaaaaaaaaa 
cccctgacta 
cttctaccct 
gcagggctca 
atctaaggtt 
cctctgacca 
acagaagtaa 
actttccgtg 
atagttattc 
gagtgtattt 
tagattattg 
gcatgtgtgt 
aagtcatcaa 
ctttggaagg 
agaaaaaaaa 
atcaactgca 
catacagttg 
atcagctcac 
acccaagctg 
aagcctaagc 
ttcacatcct 
tgtgctgctg 
ctgccacaga 
cccaaggctc 
aagaaagtca 
aagtctcaag 
attcttacag 
ggggtagtct 
caaaacaaac 
gattgcattt 
aagagagagc 
nnnnnnnnnn 
nnnnnnnnnn 
cttctaatca 
tgtccaccca 
tgaaatgaaa 
aaaatagccc 
ggtcctttgg 
gctttaggct 
gaatatcatc 



ggcaagcaag 
gctgtcttgg 
tttcctcttc 
agaacaggcc 
atagggacca 
atgatagatt 
gacattccag 
aagagtacag 
cttgtttgtt 
aactcacttt 
agtgctggga 
aacacccctc 
tatttaccac 
tagccatatt 
gctgtagacc 
gtcatgaaat 
tattgtcaca 
agtttctggc 
caaacatagt 
ttccccatct 
gttttacctc 
ggtgagatgg 
cccagcaacc 
gtgtctgaag 
aaaaaaagaa 
cactcacgct 
ggctccttct 
ggaccagtga 
aagcctagaa 
ccatgtgtaa 
aaagagattt 
tagaaaacta 
attatagcag 
tttttgtaat 
ttattatttt 
ggaattggat 
gcttgcacag 
caagcatttt 
agtccttttt 
aacttttatc 
gatggcttta 
atcacactca 
agcagctcca 
gggcgtggct 
ttcctggtga 
aaacaatctt 
gcactaatat 
ggggaacatg 
gaggtgagac 
aacaaggcct 
gggtccaacg 
tccccaggga 
tggtcagtcc 
aggaatacac 
aaagcaaagt 
nnnnnnnnnn 
nnnnnnnnnn 
ttagaaaacc 
ctgtcccaga 
gtcaccagca 
tgcagtggtt 
agcagagtta 
cagtgctgtg 
cctgttttgc 



ccaggatcca 
cttcctggga 
caagttgctt 
cataactgcc 
ctggaagtga 
tatcaatttg 
agaaaactga 
ggaagaaagc 
tggttggttt 
gtagaccagg 
ttaaaggcgt 
tttaattaac 
atgtttctgt 
ttcatcagtt 
gctttagaca 
taatttagtg 
gcacagcaca 
tctaaacgcc 
tagacattac 
gcggttccca 
tcacactcca 
ctcagaggtt 
acatggtggc 
acagctacag 
ttgccattaa 
ggcctgagtc 
gctgcctatc 
gactgatcgg 
cccacgaggt 
actgcacatg 
aaattgaaaa 
gagtacttag 
agatttctaa 
gtaagtttac 
tattaattat 
ttctgcttcc 
taggtggtca 
acttcctgtg 
tgtttaatta 
tggtaaaaag 
tcctgcaggc 
agcctctgcc 
aacaacagcc 
acctcgctgc 
gtgtggttct 
tttgcctctg 
ttactgaaca 
gaggaagagg 
tgcatctcct 
gaaagtctta 
ttagacaaag 
acactcctct 
tgaagccata 
acatacacac 
gggaaggggt 
nnnnnnnnnn 
nnnnnnnnnn 
actctcttag 
gcaaggccat 
ggctccggaa 
cctgggcctg 
acagtcatca 
cgctacttct 
cctcaggcaa 



178980 
179040 
179100 
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attctgaagt 
attctcactc 
ccacctccca 
gagaatgaag 
tgaacataaa 
gagaaatgtc 
ccccaaacag 
actcatgttc 
gcaccaggag 
tcccacagtg 
cctgggcttg 
gggacgtaga 
aatgccagca 
acagactgag 
agatagtggg 
acaaaaatgg 
tttgtcacct 
gttcccagca 
ctacaaagta 
caatcaatcc 
cttccagagg 
ctccaatccc 
agcatacaat 
gtacagaaag 
tttcctgtct 
tggtgtagtc 
aacaggatca 
ccccggctgt 
caactgagga 
tctggaattt 
gcagaactgc 
gggtctgtgc 
gaaaaaaata 
gaggcaggac 
cctctgatca 
tccattagag 
agtatggtag 
agaagttcag 
gttttgtctt 
tttgctgcca 
atgacgcttg 
actaagtgga 
tcgttgggaa 
gaaagagcca 
ccaaagctgc 
cacaggaggc 
atgaaacgtc 
gggctggcag 
tgcttttcca 
gctggacttt 
ctggctctgc 
gagaggtttg 
ctcaaaggca 
ccaagcccta 
agggggctgg 
attccgttcc 
gaatctgttg 
cgcaggcaaa 
tcatgccagc 
cagcctgagc 
aaaacaaaaa 
caatggatcc 
ttctccatag 
agagttgagg 



ctggcaaatg 
tacttctgcc 
tgctcagggg 
tagaagccag 
gttcagttac 
aagtgctaca 
atacaaacct 
cagcccagtt 
gaaatataat 
actgtggata 
agatctagca 
ggcagaagga 
tttgagaggc 
ttccaagaca 
agagaattca 
agcatgaatg 
acataccaat 
tctgggaggt 
agttctagga 
atcagtcaat 
acctgggttt 
aagggacatg 
atatatatgg 
agatcaacta 
gacatctgac 
cctgtgaccc 
aatcccacag 
ctgtgcagca 
gtgcctcagt 
gtcatttgat 
agaggaaaag 
tttctcttac 
gatgcatcct 
tcatcatgca 
aatcagtttc 
gcctgcatgc 
tatctgttta 
agttatcctt 
aacaaaacga 
agtttgatga 
aactccacgc 
taaatgttaa 
gggaggggaa 
gtgtgccacc 
ccagtgtgcc 
cagcaggagc 
tttccctttc 
ggctcatcca 
aggtcttagt 
actctgctgt 
caaccaccat 
gtctggtcct 
agtgctggtc 
gaggacagac 
agaaatagct 
taacattttc 
ccctctctgg 
acactcatac 
gatgctctca 
tatagaatga 
caaatctgtg 
ctagagctct 
accacatttc 
tttacatggt 



aaagatgtgg 
tgtgccacct 
ctaggaagtg 
cttcaaatct 
atttggttta 
attcagtgga 
tcatcaaagt 
ggctccttct 
agatgtcctt 
cagcccagtt 
ccctaaggca 
tcagttcaag 
agaaacagat 
gccaaggcta 
aggttatctt 
cttgcaactg 
tatacaaatc 
agagatgggc 
cagccaaccc 
catgggctgg 
gattcccagc 
acaacttcta 
gtaaaatgct 
taatgaaaca 
ttcttgcgtg 
aggattaggc 
tgctccactg 
ccgcccagat 
gctcctcagg 
atttccagac 
gggcactact 
aaatcattgc 
cacagtacag 
gcctctttcc 
aggcctggaa 
acgaagctct 
gcatggtgtg 
gcacttacag 
aacaagaggg 
cctgagtttg 
acatgccaca 
aaataaataa 
aaggcaggcg 
aaagctgccc 
accaaagctg 
agccaggttc 
ttcagaccac 
ctctagtgtg 
gtctaattaa 
aaggactttg 
gtcagtgggt 
tgtctaatgc 
cagcatgtgt 
atgctttaaa 
cagcagttaa 
atggtggctc 
ttttctcagg 
atatgaaaca 
gcactcagat 
gatgctgtct 
ggcttaatca 
gctttgcccc 
tattttgggg 
catttttttt 



gatttgaaca 
tcctctcatg 
gggagaagat 
aaactaagca 
aaaaaaaaaa 
tgtggatgaa 
ctcttccaaa 
catgtgagct 
tttaaggggg 
agtagagttc 
tagtggtaca 
gtcagatgag 
gaatttgtga 
cacagagaaa 
ggactgcata 
tggacaatat 
agattcagct 
agagctctat 
tacagagaga 
agagatagct 
acccacatgg 
ctggtctctt 
atatatataa 
accataacac 
acccagtgcc 
aaccactgcc 
taccccgcca 
gcttggtgtc 
gcagtgtgca 
tccagctgac 
gtgactgtta 
ggcctctctt 
gatggcaggt 
ctcaactaca 
agaaacggcc 
ggatttgatc 
gaggtatacc 
taagttcaaa 
gccggggaga 
gtcccttgag 
gcccacgcat 
atcaaaggaa 
agggagatag 
agtgtgccac 
cccagacttg 
cacctcagag 
agcagtgaca 
cctgtggcca 
ggttagcagc 
ggcattgttc 
cttcagaggt 
aaaatgcctg 
ggaatcctaa 
aaaaagtcat 
gagcactagc 
aaagctgtct 
caccaagaac 
ttttttataa 
ggcagaggca 
caaaaagaaa 
ttcctagcca 
ctggtctgga 
accactctgg 
gttgttacag 



cagacttgtt 
cacggggagg 
ggatgtcctc 
atgttttatt 
tccctacaca 
acaatcaaaa 
ggctgggtct 
ccgacttcca 
gtggggtctg 
tcgcctagca 
tgcccatgac 

gggtggggag 

gttcaaggcc 
ccctgtcttg 
agactttgat 
tgggttcata 
gggcccactg 
gactttaagg 
tacactactt 
cagtgatcaa 
cagctcacag 
tggtcaacag 
aaatcagatt 
atactgtttt 
tccacaatga 
cctgggaagc 
gaggacgtga 
actgagtgac 
attgaaactt 
cttggataac 
tcccctgcat 
cacttccctc 
acgaggcggg 
ccgccgcccc 
actcaggctg 
ttcagcacgg 
tatctctgca 
gctagcttgg 
tgcttcgctg 
cttatggtgg 
gaatgtgtat 
tggccactca 
ataaccctga 
caaagctgcc 
attacagatt 
gtggagccac 
gctgtcctgc 
gaacaggcct 
caaattggag 
cattccgtga 
agaagaactc 
gggcaccagg 
gttcaatacc 
gctttaaaaa 
tgctctttca 
ataattcaag 
acatgtggtg 
acctgctcgg 
gggggatttt 
aaaaaaacaa 
gaaggagctg 
gagcttgctc 
gcctgcacat 
taatgagagc 



tggccaaaga 
ggaggggagc 
aaagcagggt 
tccatttccc 
actggttctt 
tgttgaacac 
gaaaagagcg 
aagactgctt 
tctgacaacc 
agcgtgcggc 
cacagcactt 
gcattccttt 
agcctggtct 
tcaggaaaaa 
tccaaaataa 
catattctgt 
gtgcatgttt 
ctagcctggt 
ctaaatcaat 
tagtgccatt 
atgtctgtaa 
gcatgcacgc 
cacaaatcaa 
aaaagctacc 
gacaagcaga 
cattcctagt 
agcctccctt 
catcggagcc 
gcatgtgatt 
agcacagagg 
gattatagaa 
ctttgaagtc 
ttccgggact 
tgcagagttc 
gggatgtggt 
gcataagccc 
cagggagacc 
gcaacatgaa 
tgctgagtca 
aaggagagaa 
acacacacac 
aaatctacca 
tatgaacacg 
cagtgtgcca 
tggccaggga 
aaacctggaa 
agagtctgga 
cagtcacagg 
agagaagggt 
tcaaatacca 
atcctttttt 
ttaatgtcaa 
catcagagcc 
aattctgttg 
gaggacccag 
tcctagaggg 
caaacataca 
atgtggtggc 
gtgttgagcc 
aaaaaaacaa 
gctccagcaa 
tagaaaggaa 
ggaaaaatgg 
tttgaaatga 
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tcacaaaagg 
aaccgaatcc 
ttatggcaac 
tgcaaagtct 
gtcataaata 
ctaatgagga 
ttctaatata 
taaaatgata 
atccaaagca 
tctcatccgc 
taggacagac 
tattaggctt 
aaaaagcaac 
ttgcataatg 
gaacccggca 
ttgcagggag 
gacctgactt 
ggattgctct 
agcatggatc 
tgcataccag 
aaccacccaa 
ggattcataa 
tacccaaaat 
tttggaaaac 
gcaccaagac 
gccgctcacc 
cacagtcgca 
tttgtgaaac 
gtaattcaaa 
tgcctactga 
taacttccag 
gacccccatc 
agggtcttgt 
gcattattta 
catttcctct 
tatccttgat 
tttctgggtt 
accaacatga 
acaggcatgc 
cagagtaatt 
aatgagttct 
gggtgatgtg 
tggggagctg 
ggtacactcc 
catggcgggt 
aagtttgaca 
ccacaccacc 
caacaagaga 
ttgtgaggaa 
ttagtcggtg 
gaacaccagc 
ccgaattcca 
tggcgctcag 
tatcccctgt 
ctcccctcct 
gaccccctcc 
tttatcaagt 
gtttctattt 
actgagggtc 
accacgcacc 
ctcgcccaaa 
aaaagctacc 
acagggcggg 
tcaatattta 



aaaataggaa 
taaaaggaat 
tgacatttaa 
tgcaggaact 
tagcttattc 
aatatttcca 
atttatcaat 
aatagcaata 
gtgtagctat 
agatacagtc 
gaaggggggc 
tgttataaga 
tacttggggt 
aacaattttc 
caggctcata 
agaaagcagt 
taaacagttt 
caaagactct 
tccggccctt 
gaggactgaa 
actcatttcc 
aataaggcag 
ggaactgcat 
tggcacagct 
ccagggctga 
aatcccggct 
ctatgctctc 
tgttttcgaa 
actaattacc 
ttgtgcgctg 
gctaaatcag 
cctgacttcc 
agttctggct 
atttaggcta 
tcttcaattc 
cgtcccaccc 
tgcaaggtgg 
gcggcgcaca 
acacaccggc 
aggcctaggc 
cttcctttgg 
gctgaaacca 
cagtcggaag 
cagaaataga 
ttttagacat 
gagtgatatt 
gccacccaag 
gggagccggt 
ttgagtgtac 
aagcaggggt 
tcgaggaaca 
cagggcttaa 
caatctcatt 
atctgactca 
cctgcctatg 
actggactct 
gttctgttgg 
cctttacatt 
ctggtagttt 
aagcaaaaat 
gccagattta 
acttggcaga 
atggggcggg 
acctaactgg 



aatatgcctc 
gttcaagtaa 
aagtaacagg 
tcatgctaaa 
ccaaatcctt 
caaaaaccca 
ttaaacacat 
gtggtgaggg 
atttaaatac 
tagactcaga 
aataagaaat 
ttacatcaaa 
ttgcgcctgg 
attcaatcag 
gtttttatct 
cgggaggcat 
ctctctggag 
cggactcctg 
ctcactgctt 
gtttggctcc 
gacttggttg 
aggcagagaa 
tttcatgcac 
gacattggct 
ggcctaatcc 
ggagccgcag 
ccgtggggag 
aacctgggat 
ctgagcacat 
cagctcactc 
gcagccacag 
aggcgaggag 
gccgaattat 
gaatgacctc 
gtggccccca 
cacttggcta 
tcaggtccac 
gccgccatca 
agacatgtgc 
agttcctgaa 
gggtgctaaa 
tttctgcagc 
aggcacaatt 
cagatgaggg 
ttaggaatat 
accgggttta 
ttatcacatg 
ctatctttgt 
aacactccaa 
aactgcgctc 
ccaaaaaggc 
gtattaagcc 
tattatttct 
cattctcctg 
catgctttgc 
ctcaccccac 
tgagccttct 
tcttgtagcg 
tagagttgga 
tgcacccacc 
ttgctcctgc 
agatcaagtc 
tagggtggag 
taatttgctg 



ctaaaaagag 
aaaaaaaaaa 
atttgggcaa 
attatgcttt 
agaatttcta 
cttaccatta 
tgcataatgt 
gtggggggca 
ctcaatccat 
gcagcatatc 
gacaggaaac 
gcagttcata 
gggctgcctc 
gatctcagca 
cccagctcca 
tgtcctagtg 
gttgaaaggg 
ttacaggcaa 
tcttgaatca 
cggtgaccag 
gtcaaatttt 
acggagggtg 
aatagaaaac 
agaggaagga 
gcctttatcc 
aagagctctc 
gccgctccgg 
gatcatttaa 
ttgaaacatt 
tggtgtttct 
gcgctgcctc 
gcggcccgtt 
tgctcttatc 
tttccctccc 
ttttctgatt 
catcttcatc 
cagtcatcct 
ctgagagagc 
acatgcgctt 
gcaaattcat 
ccagcatgcc 
caagcctgtg 
ctgggatcaa 
ctgcctcctt 
aaaagtaggt 
actagagcca 
agccataatg 
ccaaaggaaa 
taacatcccc 
gagcagtctg 
cgattaatga 
ccaaagaaat 
ctacaaagat 
ctcagtaagc 
gtcttcacaa 
atacttggaa 
cttgcattaa 
tctcacatgg 
gttagatgac 
cttccctcag 
tgtaaagtgt 
agaagtgcag 
cgaataattg 
tgacaattac 



ccgaggcaaa 
atatggctca 
gtttctttct 
aattttaatt 
ccctgcaagg 
agaaccccca 
gccactctgt 
aaaaccagga 
tgtagaggaa 
cttgaqtgta 
ttcagaagaa 
tgttttaatc 
tgtgtactga 
gagatagctc 
cctgctggag 
gctgtgtacc 
gctctgtaag 
gtaaggtcct 
gggatttaga 
aggacaaggt 
caagtttccc 
tgtgtgtgtg 
ttaaagactg 
acggccaggg 
gagggtttag 
ttcacttggc 
gagggggagc 
atgtttaaaa 
tatgccatca 
ataaactgct 
cagccctggg 
tctccagaga 
cgtgttcata 
gagtcttcct 
ggtccaaata 
tgggagccaa 
aaggtgtgtg 
acgtgccctg 
tccccagcaa 
ttcccccttt 
agtggctaga 
ggcagaagct 
gaaatgagca 
attagcgctt 
tggattccca 
ttaagagact 
caagagaatt 
tgagcagccc 
tgcaggattg 
cctgtgtacc 
caaaggacac 
caaggtctag 
ccaacactca 
catcctggtt 
cgacagctgg 
ctactccttg 
agctgtgaga 
gagacaccca 
ccagcaacat 
atgttcctgg 
atcttctcta 
gctagcaggt 
aagctccaag 
gccatgaagg 



ttattatagc 
tgcgaagttg 
ccaccctctc 
agccaaatag 
agctgaacta 
tccgatattt 
agcattccat 
gattaaacat 
aacacactgt 
agggtattaa 
ataaaatttc 
tggggaggaa 
accagacagt 
ctactcaaag 
aaaccttgta 
taaagttaca 
ataccagagt 
agcagatggt 
aattgctatt 
cattgtttaa 
agcagtctaa 
tgtgtgtgtg 
aaccaatcat 
cgagccagct 
tgaggctccc 
tcagtcccag 
gacatcaagc 
tatgcacatg 
tcttggatcc 
tcagcgattt 
ttggtggaga 
gccgtttgtc 
attctcatct 
ccctcattcc 
tatagacaaa 
tgtggtgagt 
agagaggtag 
cagctcaggc 
accctgcttg 
tccagaataa 
agcctgagat 
aaccttgggc 
ctggtttata 
tgaagatgcc 
cagtcagctg 
cttcattatc 
ttcattccat 
agcgtgaagc 
cctctgcgat 
tggcttgcaa 
tcatagaggc 
gccattctcc 
atttcccagg 
tgaaacgggc 
taatttgcaa 
gaactacttg 
aggaaccaca 
ggttagatat 
gccttccccc 
catcttataa 
agcctcactt 
gacggtgagg 
agttaccagc 
gaacgctgcg 
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actatgcaag 
ttaataacac 
cagcatatgg 
ttcaattcgt 
catttgactg 
gcttcccaca 
aactttaata 
gagcgtgaga 
tgagacaaaa 
atgggagttt 
ttagtggttc 
tatggagatg 
gtagaagtgg 
tgcaccccaa 
aaattttagg 
tcgagttgag 
ttttttttta 
tggctccctg 
aaccaccgaa 
tacaagctct 
aattaccgac 
ttaaaagttt 
attagggact 
aattatacat 
aggttatgaa 
cagctccaag 
ggatcactta 
ccatcttccc 
gggtctctcc 
tttttttttt 
aactcacttt 
ctcctgagtg 
tgaagttccc 
tccggctcag 
aacccccaca 
gcaaggctac 
tgtcggaggg 
tgtcctttgc 
gccttggtgg 
atgcctccct 
atttacctgc 
agagaccctc 
gcccatcaga 
gtgattcagc 
gccactttta 
gggtatataa 
cctcacgcct 
actaagtgtg 
ggtgcctggt 
gttatgtcac 
tgctctcgta 
gaactgtaag 
aaagtaactt 
ttgtgtctgt 
ccactgacat 
gtcattcaga 
cctagggtac 
ttgtggatta 
acatcaacag 
cttgaacaaa 
tccggccttt 
ttatttcatt 
ctaaggttac 
tgagatgaag 



aatgttgctc 
acagaatgag 
ctgtgcacat 
cctcagatag 
acaggcatct 
gcccggcttt 
aggattttct 
caggaatggg 
gaatattaat 
tgaggcatgg 
atttatggag 
tagcttataa 
tgggggtggg 
gcctatgttt 
ccaatagaaa 
cagtggtgtc 
aagtccactt 
cagcgagagg 
agggaactgc 
gagcaagcta 
tgggtaatcc 
acacaatcac 
ttttgggtgg 
tgagatttgt 
atggccaggt 
tcataagctc 
ggctacccac 
aacatggctt 
aatgtggact 
tntttggttt 
gtagaccagg 
ctggaattaa 
agtgacctgg 
aattttgggg 
ttgggagagg 
acagcaagaa 
tgtgaggcct 
tccattactt 
ctgcaatctg 
ccccaccctc 
cagcctttgg 
acagaagcac 
agaacccagg 
cccataaatt 
ttgaaataga 
gagataatcc 
gcctgcaagg 
gaggtatgga 
tggtggcacc 
tgggtatggg 
aggttcctgc 
ccacttactc 
gtgtggcagc 
ctgtctgtct 
gtctcagtcc 
ctggttttga 
tcgaccagac 
tttgtgtata 
tatttagcat 
tgaaaatcaa 
taatacagtt 
gcttaaccag 
ccaaccacgc 
aacaatcaac 



tctaattaag 
ccttggctcc 
aggccgtgag 
agcccagaga 
tcccggaaag 
caaccaggaa 
cacagaaaga 
gacacaagcc 
atcttggcta 
ctaagctggg 
ggcttgctgc 
aactactcag 
gtggggcaag 
agggttgatc 
tgacctctgc 
aacggagaga 
gcttgggact 
ctccagtttt 
acagcacaca 
tctgaagaag 
actagggagc 
ttttgagttg 
tttactcgag 
caactgctgg 
aaggttgggt 
aaacaagtct 
ggacaagcac 
ctatgttgct 
caaatcatga 
ttcgagacag 
ctggcctcgg 
aggtgtatgc 
atgtcaactg 
ggacaaggta 
cagaggcaga 
gctgtctcaa 

tgggggccag 

aatcagaatc 
catttaagag 
cctgcatccc 
tgggacacag 
ctgaatgtac 
cttatgctaa 
tacacatccg 
aaagatgcct 
catgttgtct 
tagccaccaa 
taaaaaaaaa 
gtttggggag 
ctttgagagt 
caccatgttt 
tcaggtctct 
cagctaagca 
gtcattccat 
tcttggtatc 
gcttgcaatc 
tgggagtagc 
aagatctgat 
caacttaata 
acagcagtac 
cctcgtgttg 
agttaactgg 
taggaaggag 
ataacaggga 



agggctctgc 
gggagctaaa 
tgatgcagcc 
gcgcggctca 
cctgcgcgtg 
ggcttggcgt 
aaagtccatg 
aacaccccaa 
tgaaggagga 
caaaccattt 
ccggagagcc 
attttaaaca 
agaacaattg 
agcttcccga 
accacggctg 
gcagaggaaa 
cacctgaagg 
cccaggcacc 
agttaaatat 
ctgtcatttt 
aggtagtttt 
actataagta 
gctgcaacca 
ggagtagtgg 
gcttccaagt 
tcaaggggcc 
ttctcataca 
tcaacaacca 
ctacagcntg 
ggtttctcca 
actcagaaac 
taccacgccc 
aagttggatt 
ccttgattta 
ggcagaggca 
aaccaaagac 
aacaaggtgg 
gccatcacag 
ctaagtggga 
tccatttacc 
gcttagtggc 
tttcagcgct 
ggagccagaa 
tacagccaaa 
attctggagt 
ttgatgtggt 
gccacaccca 
tgtcccaaga 
gcagtggacc 
ttgtagcttt 
cctctgccat 
ttcagtcctg 
agggctgtgg 
ttatatagtc 
atatattcac 
ctcctgcctc 
aggttctgtt 
ggcccgaccg 
aactcatttg 
ctgttctcct 
cggtgacccc 

aa gggttaat 

aggaaagggc 
cagagcagtc 



atttcctagt 
ggttccatta 
cagttaagcc 
ggccctcacg 
cctacactgc 
gggtctgatc 
ggaacaaatc 
ttgctaggct 
tggtgccatc 
tctttttttt 
catcagaaga 
aacagtgcag 
catctgcaga 
ggcaagccca 
actgaagcta 
gtccaatcag 
cagggcattg 
agcccatcgg 
aggctgggtt 
taatgacggc 
ggaagaacag 
tttcacacga 
acaatgagtg 
agggtcctgg 
ctcaaatata 
tggagagtta 
aggaccggct 
gggcagggtg 
gggttttttt 
tatagccctg 
ccgcctgcct 
ggccgagtcc 
ttactgtgat 
actgggcact 
gaaagttggt 
atctttcttg 
tcaaggaaga 
atatagctag 
taaactcagg 
tgtttccagg 
ttagcgctgc 
gcagagcacg 
agtagaagca 
cccacttgaa 
gctaagtggt 
gctagggaga 
caacctctat 
cctcacgaat 
atttggtctt 
gctccccttc 
tatggacacc 
gagtcttatc 
ccgactgctt 
ctgagaattg 
ttaagacaag 
tgcctcaagg 
ctcttagctt 
actcccttcc 
gtaaagccat 
agagcagcgg 
cccccccccc 
aataaaacca 
cactcgcaca 
cttgtaacaa 



cacccgcact 
ggagcacggg 
cgctaacacc 
ccacgagccc 
aaatggacct 
cttcaagagt 
ctcctcttaa 
aactctgata 
ttctgaattg 
ctcttcttaa 
gagctcgctt 
gaggccagag 
aggctagccc 
gaagcctcta 
taaataagcc 
agcttcattt 
agtagagcct 
ttggttacct 
atctgcattt 
acaaacttcc 
ttcaccatta 
ggcaggtggg 
ttttctcaag 
taatgcagaa 
ctcctaaggc 
agacaaataa 
acctccaaca 
aattaggggt 
tttttttttt 
gctgtcctgg 
ctgcctctgc 
gtcttgataa 
gactactgag 
acacgactgt 
tggaggctag 
atccaaatcc 
ccactgactc 
gagattttaa 
ggtgggccca 
gatctgctta 
tcggggcacc 
cacggctcag 
gctggcaaga 
gtgatccaga 
acaggagggt 
taacccagga 
ttatacacac 
ctgcaaacat 
gcaggaggaa 
cagttaactc 
tggtcctcta 
atagcaatga 
gggattatgg 
aaccacttta 
atctcattaa 
cgataggatc 
tctacagtga 
ctttaagtga 
ctccccacct 
ctctcagcct 
agccgtagaa 
gtctgggaga 
aacctgtctt 
gtgcaaagga 
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gagagagagg 
aacacaagtc 
attaattatg 
tatttatact 
caatttgacc 
aatggaaggc 
tgccttttac 
tagagcccct 
attgagctct 
gtccaagaga 
aatccctcac 
tttttgagta 
ccacgggttt 
ggattggttc 
tgaccgaatg 
ccatctctaa 
aacgctttgt 
gaatcaagtg 
tactaaacca 
aaagatgcca 
ggagacagaa 
aagctaccca 
cttgtatttc 
acatagggaa 
agggtgcatc 
gtgagggccg 
tgggtggccc 
gtttccaagt 
ccagtatcaa 
acacacagag 
ggctgccaca 
cagcagcagc 
agcaactgaa 
tgcacacatc 
gcagagatgt 
ttaggaacac 
gttttcttcc 
taaacatcaa 
ccttttgtgt 
atctccaggc 
tatttattat 
aagtaggatc 
tcttaaattc 
ggtaactgat 
tgtctgcagg 
caagtaactg 
ttccccaagg 
taatatcaga 
cagatattaa 
caaccatgca 
cacctgggtt 
cactggaaaa 
acttgtgaca 
aggcaggagg 
acacacagcc 
tgggttgcca 
aagccattct 
tcacatgttc 
cttcacgtcc 
tttgtgagtg 
tcaatgagtc 
tcctggagtt 
tcagacctag 
agcagacagg 



ctgagtttct 
aatataaacc 
tagtcaatag 
tttcgaagac 
ctgtagttca 
ataaaacact 
aaagagactg 
ccctcccctc 
aagtgataga 
attcaaagaa 
aggcgttaac 
gggaaatctg 
ctacatgcag 
agcgacctca 
ctgtgagcag 
cctgcaattt 
aagcaaggtc 
taactcaatc 
gtgagcctca 
acagctaagg 
acaggaggat 
cttgaagact 
tagtgtatga 
gatggtgcct 
gatctaaatc 
ttccagagat 
aggggtccag 
attaggacca 
gcttcaggcc 
cactagcatt 
gtgagagtcc 
catttctcca 
acatggggtg 
acaggccagg 
gcagcagatg 
actgagcagg 
tcgatcacgg 
tccaaccccg 
tttcattcat 
ctcagtctaa 
gaaggctaat 
cacaaagata 
tatttaacat 
gcaaactgaa 
ctttgtcgga 
accccactat 
ggacagcaag 
gtcatctgac 
cgataggcct 
gtaactctgc 
ctcccctact 
gcacactcca 
taatttctgt 
aggggactgc 
ctctgagctc 
gcctccccta 
atttctgttc 
ttcctgatgt 
atcacagtgg 
gcagagcctg 
cagcaggaag 
gggccagatc 
catcctcccg 
taactcaccc 



acttctataa 
tttagacatg 
ccatgggttt 
aagccactca 
aaaaaagtca 
ggaaatatgg 
gagggaattc 
agctgcttaa 
tgagaatcag 
ctgtgggcca 
agtggaacca 
ttaaacatcg 
ctgtgctaga 
gaggacattc 
gagatatcaa 
gtcaatctca 
agatagaggc 
cctatctcct 
agcaaagcct 
gccagggatg 
caggacttca 
gtccccaaca 
gacgaaggaa 
caaaaacagg 
cacatacctg 
ttaataaaga 
actgaataaa 
cagcctggcc 
agcggcaggg 
cacctcctgg 
cctaccttcc 
gtaacttgct 
ctgtgacccc 
cctgaagatg 
ccaagaggtg 
catgctcttg 
tctccacgtt 
aggggccaga 
ttaaagatgc 
tctctgagtc 
taattgcttt 
tactgtttat 
agctttaata 
gccatgctct 
tttactaaga 
gtggataatg 
ccagtgctta 
tcagtctatt 
ttgattaata 
aaaacagacc 
cttctccccc 
ggttacataa 
ttgaagataa 
tgccacacta 
actgtctttg 
gaggatcttc 
ttctctcttc 
tcgcgatcag 
ccttaattcg 
gcgtgcagcg 
aaggaaagtt 
ctcttccaca 
gagaagcctc 
taaagcatca 



ataaaccctt 
gggctgccaa 
cattagcgta 
gggaaaaaat 
gaacagcaca 
acagaaatca 
agaaactatt 
cactggggtt 
aacaggaaca 
gaatctactt 
atttccaagg 
tcccacgagg 
tctgctgaag 
ttgttcacta 
aggccggcta 
gacagcaatg 
tccataaaaa 
gagattaggg 
gtctgttctc 
tagtgcaggg 
aggtagtttg 
aaacaaacaa 
gaaagctcaa 
acagccgagg 
gatttaaagt 
cccaccctga 
gggaaaatgg 
cctggcatgt 
cactggacag 
tctcttcttg 
tcaccgttaa 
ttccacagat 
ttggcaccac 
ctgggggact 
ggctgcagca 
ccgaatgaaa 
tcagagttgg 
tcatcggtgt 
attccagggt 
tgtaatgagt 
ccagttacaa 
tcaaacaaag 
aaggtacaca 
gtagcagcct 
ttctgttatc 
aagtaaatta 
tcagccgtcc 
aaacctatca 
attctacctt 
ctttgattcc 
caggaggaac 
tttgcctcat 
caaaatttca 
ccggtggctg 
cttatcagtg 
attgtggagc 
cgttctggcc 
ccgtctgcca 
agattcatcc 
gaagagagaa 
tacaagtctc 
gaccctttcc 
tgtctttcta 
ctttcatcta 



ggcaggcgga 
acttcacttt 
ttaaatacca 
ggtgggggga 
ctagagatta 
gatccctgcc 
taaaataaag 
gtggtggacg 
gtgtttttga 
aggcag.tcct 
cagccctgct 
gagcccagct 
tggccggtga 
gccctcgtgc 
ctggactgaa 
aagactgtga 
ttgttcaggg 
aagggaagga 
agcaaggtga 
tgctgtgata 
ggctataaaa 
gctgggtatg 
gcccgagacc 
agcagacaga 
aatatctggg 
ctgagtatgg 
gaggaagttc 
gctggccagc 
ttcccacaca 
acaacagata 
ggatggtaca 
actgttatag 
aaagccatgg 
gcaatgctgc 
accagagata 
agcctcgcag 
cttggtgtta 
tcctgggctc 
tgcaaacatt 
taacatcttt 
gaatccttta 
caaaggaaac 
ggtccgcctg 
ggatgtccca 
ttcaaacagg 
tgcaatttgg 
tcagaggaga 
aaccctgaag 
gttgccattc 
aggcagacgc 
tcaagacaaa 
tatccagagt 
tgaaatccga 
agaactggag 
agtcccaaga 
tgtcccatgg 
accagtggta 
tagtctctga 
tttagaaaag 
ctttctttgc 
agagagaaag 
ccatcctagg 
tgggtgcagt 
gaggagctct 



tcactaaagg 
tcgacagtat 
cgatcaatat 
ggaggaggaa 
gcaagggttt 
ttcatttttc 
gcaaaatgat 
caaaataagc 
ggcaaaatat 
ctgggacccg 
ggtgatctga 
ctttcactcc 
ggaggtgtgg 
actggggcga 
aactagatca 
ttttctagtc 
ttcaggcaga 
aggctgtgtc 
gccacccacc 
tcaacagctg 
tataagcttg 
gtggtcgatg 
tgcttgggtt 
cagggcagac 
agactggtct 
gcaacaccca 
agctggtagt 
tagtctagct 
cgacacacac 
aaatgtaact 
aactgtgagc 
cactaagaaa 
caagctgaag 
ctggattctg 
attaatatga 
tgtaatgact 
ggctgccgcg 
aatcgccttt 
agtgagaatc 
ccctagtgaa 
cagtcaaaga 
aaagcttctt 
gcaaccgaac 
gtgccacctc 
gattgtgtct 
gggtttgctt 
caattctgat 
gaaggatatt 
taagcattaa 
accctctgaa 
aaggtgccac 
ggggttaatg 
caaagccgga 
cggaaggttc 
ggggcccaga 
ggcgggaagg 
cttgctccca 
agtccacggg 
agagaagctg 
ttcagtggct 
tgctgtgact 
tgccctgtgc 
gggggcccag 
gtggtagtag 
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ggactgaggc 
tccatcccag 
ccagcctacc 
ggtgataacg 
cacttgtctc 
cacaagtcta 
ccgctgttta 
ctccatggct 
aagcttgcaa 
ctagatagat 
ctggacatca 
tttgtttaat 
annnnnnnnn 
nnnnnnnnnn 
ctcctgctgg 
cacagtgtca 
gaactcgagt 
gattccctgg 
tgtaccgatg 
gagaacagca 
gagggggggt 
aagttccaca 
agaaacagat 
ttttctcctt 
tatatctaag 
cattatggat 
agaacagtca 
tgagttaggt 
caatgatact 
ttgctggtct 
gttatgtacc 
gtttctctgg 
ctcagaaatc 
cccagtttaa 
ctggatgttc 
cctctgcctc 
ttttttaatg 
cccagagcgg 
ttttaaaggc 
cacacccgta 
tgctggtctg 
cagtttgagc 
gcagagaagc 
aacaggaaat 
cttctctact 
aaattaaact 
gcctggttct 
gcctcactag 
tggcctgttc 
cagaaccaaa 
ctcttcagca 
acgtgcggtt 
tggagctcag 
tgcggagggg 
ctgcacccta 
cactgcgagg 
aatggcccag 
cccttcctca 
tgatgcccgg 
gcgttggctg 
gggtgaagca 
gactcgagtg 
cgtgcccttt 
ggctgcccag 



ttctgctcca 
gaagacagaa 
tatgttcatt 
cagaatgcac 
cttgaactaa 
gctgggggcc 
caagtatggt 
ttaggttcag 
caacagctcc 
ggggcaggac 
ggattagttt 
attctgagtt 
nnnnnnnnnn 
nnnnnnnnnn 
ggttacaggc 
agctggatat 
tggtgaaata 
tcacatgctt 
catgcttgca 
agtgaacaaa 
gtaaagctct 
tgtaaggtca 
gcagtctgtc 
cctttctttt 
tacactgtag 
ggttgtgagc 
gtgctcttaa 
tttgtgtagc 
ctgcctgatc 
ggaactaaca 
accacacctg 
gtagtcctgg 
cgcctgcctc 
gggttttttt 
tggaactcac 
ccaagtgctg 
tgtgtatttg 
aagtgttctt 
accaaactct 
cacacccgta 
gcctgtgtat 
acaggctctc 
atctgcaagg 
ctcaggagag 
tcccagaaca 
ccttggtttg 
gctgctggca 
atccgcgtct 
aggatgatct 
cgagcctcgg 
ggggtgggga 
gtttccgtgt 
cgctggtgct 
taagaaaacc 
ccccccgctc 
agggccgtgg 
gggagcagaa 
ggacctgaca 
cagctggccc 
gctagtctcc 
gaagtcagga 
gaggtgcatg 
gaggaggggt 
aattaccgga 



gctctgggca 
aatccctaca 
taatacaaca 
aaactctgcg 
acagtgtctt 
ttcatgcttc 
taggagcccg 
agtcattctc 
aacacagaaa 
ggactgtcca 
aaacacttgt 
ttagaaccta 
nnnnnnnnnn 
nnnnnnnnnn 
ttgagccacc 
gatggcacat 
gtccaccagg 
taaggaaaga 
gccacccaca 
caaacaagcc 
tggcaggtgg 
tgtgatcgag 
atatctaaac 
ttctctccct 
ctgtcttcaa 
catcatgtgg 
ccgctgagcc 
cctgggtggc 
tgtcttaatc 
gagatctgcc 
gagtttaagg 
ctgtcctgga 
tgcctcccaa 
ttgtttgttt 
tctgtagagc 
ggattaaagg 
tatgggtgtg 
aaccgctgaa 
taggtaggag 
cacaccacat 
gagctggaac 
tggtttctgt 
ctgtgaccac 
tatttccttt 
ttgtgtgcca 
cagagtagca 
gagctgacac 
cagagtctgc 
tcctctgcat 
acgaagcccc 
caagagcaag 
ctgggagatc 
tcgcgctccc 
cccaggcttt 
ctcgttcatc 
ccaggctcag 
ggcggaaagt 
tggaggagct 
tggcgccggg 
gggagcgccg 
ctttactgca 
ggctgcggaa 
cctgtagccc 
acctgtggcg 



aggttacttc 
ctctcccttg 
actaaaatat 
gctgcagggg 
tgaggcagaa 
catgtgctta 
actgcctggg 
tgcatgcctc 
gtgctgtgag 
gtaagcaggg 
caggtggggc 
aactgtggga 
nnnnnnnnnn 
nnnnnnnnnn 
atgacagctt 
gcttgcaata 
taaaagagct 
acttgccgaa 
cacccacaca 
gggggggggg 
ctggctcctg 
tacatctggg 
cattgttgtc 
ttctctttta 
acacaccaga 
ttgctgggat 
atctctccat 
gttaccttaa 
attttgagat 
tgtttctgcc 
gttttttgtt 
actcgctctg 
gtgctgggat 
tttcctgaga 
aggttagcct 
cgtgtgccat 
tgggtgcttg 
ccatctctct 
actatacaca 
gaccatgcct 
caaaaccttt 
tctctgtcct 
gctggctggt 
taagaacgcc 
agtggcaagt 
tgggagcatt 
ttggctaaag 
aggagaaatc 
ttaagggcgg 
ctaaaggcag 
aggcgggatc 
acatgacccg 
cgccctgctg 
ctttcctttg 
ccagtcttcc 
ccttgcgctg 
ggttcttaca 
gctccggagc 
gggcactcat 
actgggaccc 
acgttcggta 
acggagactg 
tgaaactctc 
ccatgcatac 



tctgctcttc 
atctacccga 
ctattcacag 
agacggcaga 
cagggtgaca 
gtaattaatt 
ttggcctctc 
tgcctgtctc 
ggtcgacagt 
ttcatcatgg 
acttttacca 
aacaagagtc 
nnnnnnnnnn 
ngccttgaac 
tagcaatagc 
ctaacctcca 
tgctgcccaa 
gatgtcctct 
agtgcactgt 
ggattgtgac 
gtaacactcc 
cctccaacag 
gtatctctgg 
aaaaattatt 
agagggtgtc 
ttgaactcag 
■ cccccaaccc 
ctacactggc 
agggactcac 
ttgcaaatgc 
tgtttgtttt 
tagaccaggc 
taaaggcgtg 
cagggtttct 
tgaactcaga 
cactgcccag 
tggaagccag 
tccctcttcc 
cacacacaca 
gagcacacaa 
gtcgggagat 
gtgtcgcatc 
gctctgccat 
agacttttgt 
tattaaccaa 
gagagggtgt 
ggctggcatt 
agagagggga 
ctggtttgcc 
taggagagac 
tcgcccggcc 
catcagctga 
cgccccggag 
tcgctggttc 
cggcctggca 
cccccaggcg 
gcagggtccg 
gtggagagag 
gtagtggccc 
tgtccccggg 
cagaggctgc 
tcctacccgc 
actcggttca 
cacacttatg 



accattcctg 
ctttctgaca 
gcactaagct 
gttcctcctc 
cctagggaca 
actacatgca 
gcctctgcca 
tccgttggta 
ggatagatgg 
ctatgcagct 
gcacgtgcta 
cacacataac 
nnnnnnnnnn 
ttgcttctgc 
tttgtaaatc 
aagattccct 
gcctgaatct 
gagtgccacg 
ctcacacagt 
cagaataatt 
ataagtgggg 
tccttgnnga 
gtagtctttc 
tatttattat 
tgtcagatct 
gaccttcaga 
ctttctcttt 
tttgaacttg 
tacatagcct 
tgggaataaa 
tcgagacagg 
tggccttgaa 
tgccaccacg 
ctgtgtagct 
aatctgactg 
tgattttttt 
gtgtcagatc 
ctaactctga 
cacacacaca 
gtggttttat 
ccgcagtctg 
ttgactagag 
ctacattttc 
gcctgggcca 
gtgctttgga 
atgcctaaag 
tctgagatga 
gaaggtccag 
cacgtagccc 
tgagccttgg 
ctttagagac 
cccgtcacgg 
cgcaggaccc 
gcgcagtcac 
ccccggaagc 
gccaggacca 
agggctggtc 
atctgaacat 
tagtgtccac 
ctgagggcct 
ccccaggctg 
tgggtggagg 
tgcaggaggt 
gacagcctta 
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cagccacagc 

gagggtgtat 

cagggatggg 

ggctttgctg 

gcatgatgta 

tcttctcttc 

ggctctgtct 

cctgagcgct 

tgggacaggc 

gaaagaactt 

cctcatgcag 

gctgccctgg 

caagttccga 

ggcatttgtt 

tgacgtgctc 

gctctgtgga 

gatgcagacc 

ttgctctatc 

ggaatttgtg 

tcactggatt 

gaatgtgtgt 

gctatttgat 

tccactcatc 

ttcaggacaa 

tggctggact 

tctggattcc 

agcatcacct 

caaagctgcc 

cttcagtccc 

cctagggagc 

gagcctcttc 

caccagggtc 

gggtctctgc 

actcctacag 

actgtttgag 

cagccccttc 

tcttttatat 

tctgtggcag 

cctgcctttc 

cctatttgaa 

gcctctcatc 

cgactgtttt 

gctgccccat 

cctggtcggg 

tgcctttggg 

cccagactac 

cttccaagct 

cctgggccag 

tggggatgat 

ggacttgaag 

ggaggaggac 

gctgtccatg 

gccgctgaca 

ctgagggggc 

accagatcca 

ctgtgtgtga 

aaagccatac 

tactcgaggg 

aagtggtggt 

aagagtgttc 

gagaggcagg 

ggtgcaggcc 

gccagcctgg 

tcgaaaaaaa 



actgccccct 

ggatgcacct 

ccctgcccct 

gagtcgcccg 

gttaccttca 

cgtgttctga 

ttgcaccaca 

tacgagatgc 

attaagtctg 

cggggccttg 

ctgaatcggt 

gtggtggact 

ctcaacaagg 

gcaggtggtg 

tctgacatca 

catgtccgag 

tggacaccgg 

caccctgaca 

gctgcccatc 

gatcttacct 

ctgcacctgg 

cagccacacc 

ccccggctgt 

cttataaatg 

agagataggc 

atctccctcc 

ggcctgttgt 

gggttggacc 

atacaggcct 

cagttggagg 

catcgagaca 

cggatactgc 

atacgccact 

ctgagcggac 

tataggccgg 

agctccgtgg 

caggcccggc 

cagctgggtg 

gtgctgtcgc 

cccgttgcca 

ggtgcctatg 

gtggcccagt 

gtcctccagg 

accactgagg 

gaagagattc 

aggtcgggcg 

ggactctacg 

ctgagtgata 

ggcggtgccc 

cagagcgaag 

caggaggatg 

gagacggtgg 

gagcagacag 

atgggtcagg 

ctgtagcacg 

gtggaaagtg 

cttgggctga 

ccctgggtgt 

tgagatgtga 

cagacaggga 

gaaaggctaa 

tttaatccca 

tctacaaagt 

aaaaaaaaaa 



cagctctacc 

tcttgccagt 

ctcggggcag 

agatgctcta 

gccctgccaa 

gggccatgga 

ttgctgtaga 

cttccgagga 

aaaaagaggg 

tgctagactg 

tggcaggtcg 

ttaccacacc 

gagataagca 

caggaagtgg 

cgtactatgt 

cgcagtggga 

atgagtgcat 

tgcccgacct 

gagccctcct 

ttggctacaa 

tggacgctca 

cccaacgcct 

tggtccagcc 

gtgcgggcag 

ctgggacagg 

ccgggaaagc 

ctttttctgc 

ctggggaggg 

tggaagagct 

agcctgaaaa 

tgcaggtcct 

agcctgatgc 

ccaaggacat 

ccaaaagtcc 

tttcccaggg 

tccccttccc 

gtgtggagga 

cggtgttaaa 

tcatgtctga 

aggccctggg 

agagcccctg 

tggtggtgcg 

tactggctgg 

atgaggaaag 

agatggatgg 

tcagcttcca 

tggctgaatc 

agagcagtac 

ctgcggacaa 

gctctgagga 

aagtcacggg 

tggctcctgg 

aaggcaaaga 

tgcttttcct 

cacagccaca 

gggcttagat 

ggctgggagt 

cttatctcta 

atggagccag 

acagtgtgtg 

aatggttttc 

gcactcagga 

gagttccagg 

aaagagtgta 



tgccctagac 

gggtgaatcc 

ccctgcctgc 

tgtggtacac 

gctgaccaac 

tgcctgtcac 

cgagaagcta 

tgaaaaccag 

ggaagggaga 

ggtccatggc 

acggcagggg 

ttatgggcgc 

attggacttc 

ggagccaccc 

atacaaggcc 

accccacgag 

acccgagttc 

ggatgtgccg 

ggagagctgg 

actccagggc 

cacccatctg 

ggctggatct 

tattcgggag 

gcttgtcgta 

agaagatgat 

aggtgaccag 

accctcgggg 

tgaagagggc 

ggagaaagtg 

gcctcacgcc 

gggtgtcctg 

acctttgtgg 

ccccgtgtct 

catggtgtcg 

attaccccca 

tccatacttc 

tgaggtccag 

tgacatcact 

ggagcacacg 

ccccaaaaat 

ccgcctgcat 

gctgggcttg 

ggtggaggct 

tgagctcccg 

gcagccggct 

tgaccaggcc 

tccacagccc 

cagcgaagcc 

gaacagcgtc 

agaggaggag 

aacatccgag 

tgatgggaga 

acaaaagatc 

tcaggctctc 

gtcagacaca 

ttagctttca 

ggagttgagt 

gccccagaag 

aactggccgg 

tgtacctctg 

ttaagagagt 

ggcagaggca 

acagccaggg 

gaagggtgga 



tctatacgac 

atcccatgtc 

cccagccttt 

ccttatgtgc 

agccaagcca 

cgccaggggc 

tgcagtgagc 

gagggctctg 

actgagtgtc 

cgaatcagca 

gatcccaact 

ttccgagacc 

acctatgaga 

catgttcctc 

cgtcgcacac 

tatcctgcca 

tacacggacc 

gcctggtgca 

gaggtgtccc 

aaagaagctg 

accagctatg 

cctgccctgg 

gccacaggcc 

gaggccactc 

ttagaacagg 

ccaggctctt 

tctcgaccag 

aagattgtcc 

ggtaacttcc 

cagccacctg 

ttggctgaga 

gtacgctttg 

ctgcagcctg 

aagaagggca 

cccagcccag 

ccagcactgc 

ggtcgggagc 

cccgagggct 

gctgtgtaca 

gccaacaagt 

ggccgcttct 

caggccttcc 

tcccaggagg 

gtgtccgggc 

gcttcctcag 

gacctgccgg 

caggaggctg 

tcccagggcg 

aagtcagggg 

gaggaaggct 

ctcactctgt 

gacagagaag 

ctccttggtg 

atatgctggg 

gtgcatggaa 

ggagacagaa 

ggtagagcac 

aagtattaag 

aacagtcggg 

aggctctcat 

ccagaagggc 

ggcggatttt 

ctatacagag 

agccagggac 



aagctctcca 

tatcaaatgt 

tgcgagctga 

aattctccct 

aggtgctctt 

tggcctgtgg 

tccggctgga 

aagagaaaaa 

ccacctgcca 

acttccacta 

atcacccagt 

ttcgtaaatc 

tgacccggca 

accacatctc 

cgcgctcggt 

ccatggagcg 

cctctatctt 

gttctaacca 

aagacctgca 

tgaaggagaa 

gcgtggtaca 

cccctgaacc 

aggaggacat 

catgtgagac 

ctacagaagc 

cctccagtca 

gccgtaggag 

ttccagaggg 

tggccaaagg 

tgcacctgca 

tggtgtttgc 

aggctgttcg 

tgctagacac 

agctagaccc 

cccagctcct 

acaagttcat 

tggcgtttgc 

tagagatcct 

cagcctggta 

acctcctgaa 

acctgtacac 

tcacccacct 

agggcaaagg 

ctggctcctg 

gactggggct 

acacggagga 

aggccgtgag 

aggagagggg 

acagcagcca 

gtgtggtgtt 

ctgacacgat 

aggaagagga 

agcccgtggg 

tgtgggtcca 

tgtggaagtg 

agctccttta 

ttgtctggta 

aaataaaagc 

tggaagtggg 

ggttccatca 

tgggcttggt 

tgagttcgag 

aaaccctgtc 

aagtctgtac 
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aagaaggaac ttgggagcat tgccgaaagg atgacctctc tgcaggtcct gcccgaggag 205860 

ccagtttctg gggaccttga ccatggctag gtgaatggac ccaggatggg atggtcaggc 205920 

ttgctagcag agccacagcc gagttggctg ggtggggtgg ggtggggtgg gaagggtgag 205980 

ttatctgatg agctcaggac cttttcctgc cctgcagata cagcctgcaa gatggtccgc 20604 0 

tggctgtctg ccaagcttgg ccccacagta gcctctcgcc atgtggcccg gaacctgctg 206100 

cgcctgctga catcttgtta tgttggtaag gtctgtggtt agtgctggag accaggttcc 206160 

ccagccaggc ttctgcccat ccttagccct ctctaggcga ctccttccct aacttcccag 20622 0 

cactccctga gcagggcctg ggtctcaccc attaagctgg gttttcttgg gtaagtgggg 206280 

aagagcccag tattgaatga atagaagcca ccccacagtc tcagaaggqc ggcttccctc 206340 

ctgccctcca ctggcttctc aacgctgctg cccttccttg gtagggccca ctcgacagca 206400 

gttcaccgtc agcagtgatg acacccctcc actgaatgcc ggcaacatct accagaagag 206460 

gccagtccta ggtgacatcg tgtcggggcc tgtgctcagc tgcctcctcc acattgccta 206520 

cctgtatgga gaacccgttc tcacctacca gtacctgccc tacatcagct acctggtcag 20658 0 

tccctggttc gtcaaacccc ggcttggggg tgggggcaag gatccaagga ccagccccag 20664 0 

gtcttggggg ttccaggagg tctgtggggt gacctgtccc tccctcatct attctgtggt 206700 

tctaggtagc cccagggagc aactcaaacc ccagccgact gaacagccgc aaggaggccg 206760 

ggctgctggc agcggtgaca ctgacgcaga aaatcatcgt atacctctct gacacgaccc 206820 

tcatggacat tctgccccgc attagccacg aggtcttgct gcctgtgctt ggcttcctca 206880 

cctccttcgt cacagggtag gcccctgctg cttgggagag ccacctggct gagggggccc 206940 

ccaggaaggg ctaggaagct cagggagaag cagataccgg cctgagtcat ggttctgatg 207000 

ttgggggtag tggcacaggt ctttcattcc agcacccaga ggagggcaag tttctgtgag 207060 

tctgagacta gcctggtcta cagagagagc tccaggctat ctaaggctcc atagtaagac 207120 

tctgacttaa gaaaagagtc gtggttcatt ctgggttgtg ggtgtggctt ggtgatggga 207180 

cactttccca gcatgcagga ggagctatgc ttgagttcca gcccttcaga aaaacaaaaa 207240 

tgggggctgg aaagaatagc tcagggttta agagcactgg ttgctcttcc agaggatcca 207300 

ggttagattc ccagctgcca catggtagct cataaccatc cggcagttct atggaacctg 207360 

ccaccctcct tcggtctctg tgggcactgc aaacatgtgc * acagacatac atgcaggcag 207420 

aaaaaacacc catacacata aaattagacc aaaaaagttc atgttctctc ctacctgtag 207480 

ctctgactaa gctacactgc ttccctgtgc ctcagtttcc tcccctggtc tggactgatc 20754 0 

agccttacat gcagctcctg ttatttgaag ttcctggtaa attggtcaag tccttcaggg 207600 

aagggctggg aactcttgca ctttgattct aggttcccca gtggggccca ggcccggact 207660 

gtcctatgcg tgaaaaccat cagtctcatc gccctcatct gcttgcgcat cgggcaggag 207720 

atggtccagc agcacctgag tgagccagtg gccaccttct tccaagtctt ctctcatctg 207780 

catgagcttc ggcagcaggt aggcaggcag cttctgggct gggtgggcca ggccaggcca 207840 

ggccaggcca gggcagtgga cccactgaat ctgtggtctt cctacccgca ggatctgcca 207900 

ctggatccta agggctgtac tgagggccag ctgccagagg cgaccttctc tgatgggcag 207960 

cgacgaccag tggaccccac cctgctggaa gagctgcaga aggtgttcac cctggaaatg 208020 

gcgtacacaa tctacgtacc tttctcctgc ctgttgggta ttgcccatca cgttcctttg 208080 

cacagagttg gtgactacat ctcttccctg gggtgggccc cgatgctttc acctccagag 208140 

tcagcaatgg aatcttttta tttttatttt gacatggggt ctcatttagc ccaggctgac 208200 

ctttaactcc agctccttcc agcttccacc gtctcctgtt ggcattgtag tcatgtggca 208260 

ttgctcaggc ttcttncatg ttcttatttt taaatgacct gtgtgtgtgt gtgatatctg 208320 

tgtgagtgtg gaggtgacag aataacagtt ggggggtcag cagatgcctt gcctgatgag 208380 

catctctcta gatccagttt ttggttttgt gggcttttat gtgtgtgttt gtttgtctgt 208440 

ttttgtagac agggtctctc tgtgtagcct ggccatcctg gaactcattc agtagaccaa 208500 

gctggccttg agctcacaga gattcacctg cctctgcctc ccagtgctgg gattaaaggc 208560 

gtgtaccact cctgcctggc tttgtttttg ttaaccacca tcctcctgcc tcagcatctg 208620 

cctcccctgt gctgggatta caggtgtgtg ctatcacacc cagctaacag tggatttaaa 208680 

cgtaggaatt ttaggatcag agtgaccaga tttggtccta gggcccaatt tccacagtga 208740 

ttatctatct tagttaggat ctctgttgaa aatcatggtg gaacatcatt accaaatgca 208800 

acttggggag gaaaaggttt attttgtctg acaactctca ggtcaccaag ggaagtcagg 208860 

gcaggaactc gaggcagaag ctgaagcaaa agccatggaa gaactctggc ttgttcctca 208920 

tggcttgctc agtctggtgt accccctccc cacccacctc cccacaatgg tttctctgct 208980 

tatgcctggc tgtcctagaa ctcactctgt agactaggct ggcctcaaac tcaagagatc 20904 0 

cccctgcctc tgcatctcaa gtgctaagat taaaggcggg tgccatcacc cctgccccag 209100 

gggtggcact acccactgta aattggtccc ttcccatatc agttgttaaa taagaaaact 209160 

cctccatagg ccaatctggt gggggaattt tctcagttga gggtttctct tctcaaatga' 20922 0 

ctgtagctga tgccaaattg ataaaacaaa tctcaaacca ccaccaccaa caacaataaa 209280 

accaaacaaa ccaaacaact aaccaagaca gtgacttata aagagaatct gaacattttc 20934 0 

cagcaggaaa ggctcaggag ctggccattc aagtctgggg aacagaatgt aggggaatat 209400 

gatggtctcc agaagctacc tgcaaaggaa tgaacagctt gctgggtttt gtggcttccc 209460 

ttatgggatg ggcgctgtac tgggcttctc tctgagtagg atgggccacc ctgtagttgg 209520 

gaatattttg ctcctacaga attgtaagtt cccagaggca ggacacatct gtcttattct 209580 

tcattgtgtg tctgatgcta gaatggtgcc tggcatacac gtgtgtgtct ctatagagac 209640 
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agcactcatg 
gacgagaact 
ctaagctctg 
ccctgtaggt 
agggctctat 
catggctagt 
ccactcaggg 
gccccagagt 
caacaggaat 
cgggaactgg 
ccacttccac 
ggccgccctg 
ctggccgctg 
ccagcaccgc 
ctgtgatggg 
tgttcgacgg 
acgtacaatg 
ggacttagaa 
tcagttgctg 
agtatggctt 
tggaggaggg 
cagactgttc 
tgagattgtg 
tttgaagtag 
aagacccttc 
atgcctgccc 
gactgcagga 
tctaggtacc 
cagccctgtg 
ggagggtcag 
gcagtgatgg 
tgggtggagg 
gtgttgtggc 
ttctacgagg 
tgaggtccta 
ggtgtccctt 
agctcaggaa 
cctgatgctc 
ggatcagggg 
ttactgcctg 
gcaggaacag 
cagagctcag 
gcagtgggaa 
acagggaagg 
gtctctaaat 
tagtcccagc 
atgccaggtg 
cagggggttc 
acacacacga 
catgttagta 
tttatttttt 
atcgcaacac 
acagagcaag 
acaaaatatt 
ggctgtcctg 
tgcttctgcc 
ttatagattt 
aggggtcaga 
ccatgctggg 
ctagagtttt 
tttcgagaca 
gctggcctcg 
gtgcgccacc 
ttgttttgtt 



tctacgtatc 
taataaccca 
ggtagatagc 
gacatcatcc 
ctggaaagca 
gccggccctg 
acctttggga 
cctgggccac 
gaagacaacg 
ctggcgtact 
cagatccgcc 
agcagtgaag 
tacaactatg 
aaaagcgtct 
gcagtgcacg 
ctgctttact 
gcgtattttg 
tctgtgggac 
caggcgtctg 
tggcttcttg 
tcagggcaga 
tggggctgga 
ttttctgagg 
ccagggctct 
gcacagtgga 
cacacaccag 
agccaggctt 
agtcaggaaa 
tgggtggggc 
gcctactggt 
atggcgacgg 
gctgaaccct 
tggcttctcc 
ctggccagcc 
tcctttcatt 
ataactactc 
ccagctgcac 
cacagcgatg 
tcacgccctg 
gagaccactg 
gcgcaggcct 
caggttctgg 
ctctcctccc 
caggctgccc 
cccaacattc 
tctcggagtc 
ttctttgaga 
aacacccttt 
acacaaataa 
cgattggatg 
ggtcttaaaa 
tcaggagcag 
tttcagggca 
taaaagattt 
gaactcactg 
tcctaagtgc 
ggtgttttgc 
ggcgagtatc 
actgggtcat 
tttcccccct 
gggtttctct 
aactcagaaa 
acgcctggct 
tttttctttt 



gataaaggaa 
gagtagccca 
taccttgcca 
ggaaaatcat 
tgagcccgag 
aatgggaccc 
gtgtcctggt 
tgggctccct 
ccctgaagcg 
ggcagtacga 
tgcagagctt 
acttctttct 
gggacgggac 
tctacgtggg 
tctgggaccc 
gtgccttagc 
acggggaaga 
ccaagtttga 
ccttctacct 
tacgacgtca 
cggaggtcgc 
cccttgactt 
gtcactgggg 
ttgatagcca 
tccttcagac 
catcaccatg 
gcaggtcagg 
gacactcagg 
gctggcacgg 
ttcgtgggcc 
gcacttgaac 
gggcttgttc 
tcgggcttca 
catgaagggg 
tctacttagg 
ggggacattc 
aggaggcagg 
ttctaatgag 
tccgtgaccc 
agcagagtcc 
gcgcatcgtg 
gaggggagaa 
ccgccgtcca 
ccagctctat 
taagaaagtg 
ttctctgagc 
ggacctgagt 
atggcttctg 
aaataaatgt 
tggcagtgcc 
ttttaaaaga 
aggcaggtgg 
gccaaggtta 
acttactt.ct 
tatagaccag 
tgagattaaa 
ctgcttgtgt 
agctcttcct 
ttgcaagagt 
ggctgtcctg 
acataggcct 
tccgcctgcc 
cagaatctgt 
ctttttttgt 



gctgttttgg 
gtcagtacct 
tcttccctga 
ccccaaccat 
ctctcgaaac 
tcagagtggg 
tggaaatcgc 
ctctggagtg 
ggagctgcct 
gatcggtgtg 
cccagggcac 
gagtggcagc 
caatgagacg 
ccagcttgag 
cttcacaggt 
caggcctctg 
ttcagtgagg 
atccactccc 
gtaagaacca 
cctgtcgtcg 
agctagttag 
cactgtggaa 
aatgggatgc 
ctaagtcccc 
agccgggtgc 
gccagctccg 
aggggtgcag 
ggactccacc 
atggggcttt 
tgaatggggt 
acaatctcct 
gctcgttggc 
tggtgctcct 
acattctaca 
gcctggtctg 
agtggggtgg 
ctaggggcag 
taacccttgt 
agttcaggtt 
atgccccctg 
ttcctgcctc 
atagggctgc 
gatggaggtg 
ccaggacccc 
tcaggatggc 
accagttctt 
ttggttctca 
tggacacata 
taaaagaaga 
tcacagaatc 
ttggtttagg 
atctctatgg 
cacagagaaa 
tatttgagct 
gctggcacct 
gtagtgttat 
atatatgcac 
ggaactagag 
tacaagtact 
aagtagaatc 
ggctgtcctg 
tctggctctc 
ttttaaatga 
ttttgttttt 



tgccttggct 
tcttagaact 
gagttggtcg 
ccagccagca 
agctgtctcc 
atccagatcc 
ggtagtagcg 
cggagtgcqc 
agccagcagg 
acgggggccg 
aaggaccgga 
gcttcccgcc 
gccccgcagt 
gagcgggccc 
ggaacgggac 
caggaagaga 
ccaacttacc 
aaaatttaga 
ttgtaaagag 
agcatgctat 
gcagcaagat 
aggtgtgggg 
agatgtgtcc 
ccctgacggc 
actccactct 
ttcctgggct 
aggaacgctg 
tctcttccgg 
gagctgcagt 
cctatagcat 
cgtcagcccc 
agatacccgc 
gatcaaggtg 
ggagaggaca 
gaaaatggcc 
gaatcagggc 
ccatatttgt 
aaataaagcc 
ctgggctgtc 
ctatattcaa 
ttgtgggagg 
aaagacaggc 
aggggaccct 
tctggggtca 
ttcctatggg 
gcactgtcta 
ttcttatgtg 
cagcggcacc 
tgtgggactt 
agtggtggta 
gtttgaggcc 
ctctttctca 
aggatctccc 
taaactcacg 
accatgcccc 
taccttcatg 
ttatggaagg 
tctgagccat 
tgttcttgtt 
gaactcactc 
agaatactgg 
gagtaatagt 
tggctcattt 



caggcttaca 
tctgttgttt 
ttccccactc 
gggagctggc 
tggaacccac 
aggacgatgg 
ctgactctca 
gaggcctcag 
atgggctgag 
atgcccactt 
tcaaatgcgt 
ctgtgcgcct 
tcatctatgc 
atgtggtgag 
aggtgaggcc 
ctagtgcgaa 
agaagagtca 
agcaatcggc 
agattccacg 
aagtatcgag 
gtgaagagag 
gagaaagccc 
tgagttggag 
tttttcagga 
tgtggctgtc 
gcgctttgtg 
actgggggtc 
cagtgacagg 
ggatggagtg 
agggtgggtg 
gagttccgac 
agtggccgga 
acgggcctgg 
actgactgcc 
ggtttatgct 
ctcgtaggcc 
tagaactgac 
cttgcttgga 
aggaggctgt 
ctgatggggg 
tcatagacct 
atttctccct 
actgttgctt 
gggtctcagt 
tcctgggtgt 
aagtaaggac 
gctctggctc 
gcctgcacgc 
ttgtacctca 
tatttattta 
cacaccatta 
agcctggtct 
aaaaataaaa 
tattagccct 
aagatcctcc 
actattttct 
cagtgctaat 
ttgggagtca 
ctccagcccc 
ttgttttgtt 
tgtagaccag 
aattaaaggt 
tacaggtttt 
gttttatttg 



209700 
209760 
209820 
209880 
209940 
210000 
210060 
210120 
210180 
210240 
210300 
210360 
210420 
210480 
210540 
210600 
210660 
210720 
210780 
210840 
210900 
210960 
211020 
211080 
211140 
211200 
211260 
211320 
211380 
211440 
211500 
211560 
211620 
211680 
211740 
211800 
211860 
211920 
211980 
212040 
212100 
212160 
212220 
212280 
212340 
212400 
212460 
212520 
212580 
212640 
212700 
212760 
212820 
212880 
212940 
213000 
213060 
213120 
213180 
213240 
213300 
213360 
213420 
213480 
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ttttgagaca ggtctcactc tgaaccccta ggtggcctgg agcttgctat gtagaacaca 213540 

ctgactttaa acttgttttc tgagtgctgg atttatgggc ttgtgctatt ttgcccagcc 213600 

tctgatggtt gttaataaca atattattta gcttttcttt tggagatagg ctctcactgt 213660 

atatcaccca gacagtggct ggtctggaaa tcactgtgta ggccaggctg accttgaatt 213720 

cacagagatc tgcctcccga gttcagaaat taaaagcact ctgggatggt tttggagttt 213780 

ggtgagtacc caagcctcca ttgatgctat ctgtccctcc cgctctctgc aggctgtaga 21384 0 

gggcagcgtg ctcatcagct cctcttccga ccattccttg actgtttgga aggagctgga 213900 

acagaagccc acgcaccact acaagtcagc gtccgaccca atccacacct ttgacctgta 213960 

cggcagcgag gtggtcaccg gcactgtagc caacaagatt ggtgtctgtt ccctgcttga 214020 

gccaccctct caggccacca caaagctcag ttccgagaac ttccgtggca cgctcactag 214080 

tctggctttg ctgcccacga aacgccacct cctgctgggc tcggacaatg gcatcatccg 214140 

cctcctggca tagggccagc caggagttgg ctgagggcag ggcgagatga catctctcag 214200 

ggcccgctcc tcattcttga tctcgaagcc gattcttcta ggcaagcccc aggctctggc 214260 

tacccacatg gcctgctgtc tgggattgca cagctcctga atctccaaag ccttgaagtg 214320 

gcttcatgaa actcgggaga tactgttcct aaccagcaag aattggggca aggaaagcac 214380 

tgtgatcccc attgctcccc agttctgcct tctggattca catggggaca gggcagctcc 214440 

aggaaatgaa aggagttggg cctttgctca gccagcttcc tctagccacg ctctccttag 214500 

ctctgtttct cccttgggta ggaaactgct cctgtctagg gttctgatgg tactgggact 214560 

ccaggctcag gagggctggc caggacctac gactttcagg gcttggtctg gggttttagc 214620 

attcattcag ccaggtcttc agtatgggac cagaaaaaag gggatgtgag aacagggcta 214680 

gggaaggggt tatatgggcc cagctggtcc aggaatgaat ccatgccttg ccttggtacc 214740 

cctaaccaca gcgtttgtgc cttcagccgg ggaggcagcc cttgggacca gcatccctag 214800 

ggacaggagg cagcgggaat catctctgta tctcgggttc tgcccagggg atgggcagac 214860 

tctgccatct cttgagtgtt cgtttggaga agcctgagat gtggcccctg ctgccttctc 214920 

actagttgca gtctatgtaa ataaggtcaa taaattcttt ggaagagcca cggagctgag 214980 

tgaggctgtg ttgtgttttg ctttgcctag gctgggctca ggcagctctg cctcagcctc 215040 

ccaaggagct ggggaactgg tatatgtcac tgtatatgtc ■ actgtgcctg gcttatggct 215100 

tggcttggct ttttttcaga tggtctcaag tgcctcaggt tggccttgat cttgggatga 215160 

ccttcctgct tgaaacagag tagtgggctt ataggcatga cccaccaggt ccaattttta 215220 

ttttttaaag gcattgattt ttatacgtgt atggttgttt tgcccacttg tacatatgca 215280 

caccatactt gtgtctggtc cctgcggagg tcagaagagg gcatcgggat cacctggaac 215340 

cgaagttaat gaatggttat gagccacatc tcgatgctga agattgaacc tggatccttt 215400 

gcaagagcag ccagtgttct tacccactga gccatctcta agccccacac ccagcttctt 215460 

ttgatacaag gtctggtagc tcaaacttga tatgcagccg aggaggttga cctggtattc 215520 

cctacctacc ctcttctctc taccttccaa gtgctgatat tatacatagg catggatagt 215580 

catgcccacc agtttgcctt gatggcacca gagtcaggaa agtccaaacc tggtagttgc 215640 

aaacacagca agagggtaga ggcagccatt gtcctctggc tgccttggat acagagcttc 215700 

tgggttgggt ggccttgggt cagttttccg aatggttcac ccttggggaa agggaacact 215760 

gctgaagagg tgggaccctg ggagggccgg cctccagctg ggtctctcca gccctcgcct 215820 

tggaacctag gctggaggga gccaaccagg atcctggact tgctacagtt aggtgaacag 215880 

gctcctgcag cctccccttc ccttgggtag ctgtggtggt ggtggtggtg gtggtggtgg 215940 

tggtggtggt ggtggtggtg gtgggggggg gggngnngnt 215980 



<210> 17 
<211> 473 
<212> PRT 
<213> Mus sp. 

<400> 17 

Met Lys Arg Ala Ser Ser Gly Gly Ser Arg Leu Leu Ala Trp Val Leu 
15 10 15 

Trp Leu Gin Ala Trp Arg Val Ala Thr Pro Cys Pro Gly Ala Cys Val 
20 25 - 30 

Cys Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gin Gin Gly Leu 
35 40 45 

Gin Ala Val Pro Thr Gly lie Pro Ala Ser Ser Gin Arg He Phe Leu 
50 55 60 

His Gly Asn Arg He Ser His Val Pro Ala Ala Ser Phe Gin Ser Cys 
65 70 75 80 



125 



WO 02/29059 



PCT/US01/31488 



Arg Asn Leu Thr He Leu Trp Leu His Ser Asn Ala Leu Ala Arg He 
85 90 95 

Asp Ala Ala Ala Phe Thr Gly Leu Thr Leu Leu Glu Gin Leu Asp Leu 
100 105 110 

Ser Asp Asn Ala Gin Leu His Val Val Asp Pro Thr Thr Phe His Gly 
115 120 125 

Leu Gly His Leu His Thr Leu His Leu Asp Arg Cys Gly Leu Arg Glu 
130 135 140 

Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gin Tyr Leu Tyr 
145 150 155 160 

Leu Gin Asp Asn Asn Leu Gin Ala Leu Pro Asp Asn Thr Phe Arg Asp 
165 170 175 

Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn Arg He Pro Ser 
180 185 190 

Val Pro Glu His Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 
195 200 205 

Leu His Gin Asn His Val Ala Arg Val His Pro His Ala Phe Arg Asp 
210 215 " 220 

Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Met 
225 ~ 230 ~ 235* 240 

Leu Pro Ala Glu Val Leu Met Pro Leu Arg Ser Leu Gin Tyr Leu Arg 
245 250 255 

Leu Asn Asp Asn Pro Trp Val Cys Asp Cys Arg Ala Arg Pro Leu Trp 
260 265 270 

Ala Trp Leu Gin Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Asn 
275 280 285 

Leu Pro Gin Arg Leu Ala Asp Arg Asp Leu Lys Arg Leu Ala Ala Ser 
290 295 300 

Asp Leu Glu Gly Cys Ala Val Ala Ser Gly Pro Phe Arg Pro He Gin 
305 310 315 ^ 320 

Thr Ser Gin Leu Thr Asp Glu Glu Leu Leu Ser Leu Pro Lys Cys Cys 
325 330 335 

Gin Pro Asp Ala Ala Asp Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 
340 345 350 

Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly Asp Thr 
355 360 . " 365 

Pro Pro Gly Asn Gly Ser Gly Pro Arg His He Asn Asp Ser Pro Phe 
370 375 380 

Gly Thr Leu Pro Ser Ser Ala Glu Pro Pro Leu Thr Ala Leu Arg Pro 
385 390 395 400 

Gly Gly Ser Glu Pro Pro Gly Leu Pro Thr Thr Gly Pro Arg Arg Arg 
405 " 410 415 



Pro Gly Cys Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 



126 



WO 02/29059 



PCT/US01/31488 



420 425 430 

Gin Ala Gly Ser Gly Ala Ser Gly Thr Gly Asp Ala Glu Gly Ser Gly 
435 440 445 

Ala Leu Pro Ala Leu Ala Cys Ser Leu Ala Pro Leu Gly Leu Ala Leu 
450 455 460 

Val Leu Trp Thr Val Leu Gly Pro Cys 
465 * 470 
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MSWQSGTTVT QSPVQAAQVS GCCVELLLLL LAGELPLGGG CPRDCVCYPA 



MLPGLRRLLQ APASACLLLM LLALPLAAPS CPMLCTCYSS 
CP — C-CY — 



5 1 



LRR NT 



LRR 1 



PKVTTSCPQQ GLQAVPVGIP 
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2 ( 57 ) Abstract: The invention relates generally to genes that encode proteins that inhibit axonal growth. The invention relates specif- 
ically to genes encodign NgR protein homologs in humans and mice. The invention also includes compositions and methods for 
^ modulating the expression and activity of Nogo and the NgR proteins. Specifically, the invention includes peptides, proteins and 
£^ antibodies that block Nogo-mediated inhibition of axonal extension. The compositions and methods of the invention are useful in 
^ the treatment of cranial o cerebral trauma, sprial cord injury, stroke or a demyelinating disease. 
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